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DESCRIPTION 

Development of Novel Anti-Microbial Agents Based on Bacteriophage Genomics 

RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Application No. 09/407,804, 
filed September 28, 1999, entitled DNA SEQUENCES FROM STAPYLOCOCCUS 
AUREUS BACTERIOPHAGE 77 THAT ENCODE ANTI-MICROBIAL 
POLYPEPTIDES, and claims the benefit of U.S. Provisional Application No. 
60/110,992, filed December 3, 1999, entitled DEVELOPMENT OF NOVEL 
ANTIMICROBIAL AGENTS BASED ON BACTERIOPHAGE GENOMICS, which 
are hereby incorporated by reference in their entireties, including drawings. 

BACKGROUND OF THE INVENTION 

The present invention relates to the field of antibacterial agents and the 
treatment of infections of animals or other complex organisms by bacteria. 

The frequency and spectrum of antibiotic-resistant infections have, in recent 
years, increased in both the hospital and community. Certain infections have become 
essentially unbeatable and are growing to epidemic proportions in the developing 
world as well as in institutional settings in the developed world. The staggering 
spread of antibiotic resistance in pathogenic bacteria has been attributed to microbial 
genetic characteristics, widespread use of antibiotic drugs, and changes in society that 
enhance the transmission of drug-resistant organisms. This spread of drug resistant 
microbes is leading to ever increasing morbidity, mortality and health-care costs. 

Ironically, it is the very success of antibiotics, resulting in their widespread 
use, that has contributed the most to rising numbers of drug resistant bacterial strains. 
The longer a bacterial strain is exposed to a drug, the more likely it is to acquire 
resistance. Today, a total of 160 antibiotics, all based on a few basic chemical 
structures and targeting a small number of metabolic pathways, have found their way 
to market. Over-prescription of these drugs, as well as the failure of patients to 
comply with the complete antibiotic regimen, has lead to the rapid emergence of 
antibiotic resistant strains. Such misuse of prescriptions, careless use of antibiotics in 
virtually all commercial production of beef and fowl, and changing societal 
conditions, such as the growth of day-care centers, increased long-term care in 
hospitals, and increased mobility of the population, has provided an environment 
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where drag-resistant microbes can emerge and spread. Thus, virtually all common 
infectious bacteria are becoming, or have already become, resistant to one or more 
groups of antibiotics. Such resistance now reaches all classes of antibiotics currently 
in use, including: p-lactams, fluoroquinolones, aminoglycosides, macrolide peptides, 
5 chloramphenicol, tetracyclines, rifampicin, folate inhibitors, glycopeptides, and 
mupirocin. 

Over the last 45 years bacteria have adapted genetically to avoid the 
destruction/alteration of the essential pathways that these chemotherapeutic agents 
target. Antibiotic resistant bacterial strains are now emerging at a higher rate than the 

10 rate at which new antibiotics are being developed. The consequence of this dilemma 
has been a dramatic increase in the cost of treating infections what would otherwise 
easily succumb to routine antibiotic therapy. Furthermore, and perhaps most 
importantly, the emergence of multiple drug resistant pathogenic bacteria has led to a 
significant increase in morbidity and mortality, particularly in institutional settings. 

15 Most major pharmaceutical companies have on-going drug discovery 

programs for novel anti-microbials. These are based on screens for small molecule 
inhibitors (natural products, bacterial culture media, libraries of small molecules, 
combinatorial chemistry) of crucial metabolic pathways of the micro-organism of 
interest (e.g., bacteria, fungi, parasites, worms). The screening process is largely for 

20 cytotoxic compounds and in most cases is not based on a known mechanism of action 
of the compounds. Pharmaceutical companies have large programs in this area. 
Classical drug screening programs are being exhausted and many of these 
pharmaceutical companies are looking towards rational drug design programs. 

Several small to mid-size biotechnology companies as well as large 

25 pharmaceutical companies have developed systematic high-throughput sequencing 
programs to decipher the genetic code of specific micro-organisms of interest. The 
goal is to identify, through sequencing, unique biochemical pathways or intermediates 
that are unique to the microorganism. Knowledge of this may, in turn, form the 
rationale for a drug discovery program based on the mechanism of action of the 

30 identified enzymes/proteins. Genome Therapeutics Corp., The Institute for Genome 
Research, Human Genome Sciences Inc., and other companies have such sequencing 
programs in place. However, one of the most critical steps in this approach is the 
ascertainment that the identified proteins and biochemical pathways are 1) non- 
redundant and essential for bacterial survival, and 2) constitute suitable and accessible 

35 targets for drug discovery. 
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SUMMARY OF THE INVENTION 

While animals such as humans are, on occasion, infected by pathogenic 
bacteria, bacteria also have natural enemies. A number of host-specific viruses, 
5 known as bacteriophages or phages, infect and kill bacteria in the natural 
environment. Such bacteriophages generally have small compact genomes and 
bacteria are their exclusive hosts. Many known bacteria are host to a large number of 
bacteriophages that have been described in the literature. During the 1940's - 1960's, 
phage biology was an area of active research. As a testimony to this, the study of 

10 phages which infect and inhibit the enteric bacterium Escherichia coli (E. coli) 
contributed much to the early understanding of molecular biology and virology. 

This invention utilizes the observation that bacteriophages successfully infect 
and inhibit or kill host bacteria, targeting a variety of normal host metabolic and 
physiological traits, some of which are shared by all bacteria, pathogenic and 

15 nonpathogenic alike. The term "pathogenic" as used herein denotes a contribution to 
or implication in disease or a morbid state of an infected organism. The invention 
thus involves identifying and elucidating the molecular mechanisms by which phages 
interfere with host bacterial metabolism, an objective being to provide novel targets 
for drug design. Whether the phage blocks bacterial RNA transcription or translation, 

20 or attacks other important metabolic pathways, such as cell wall assembly or 

membrane integrity, the basic blueprint for a phage's bacteria-inhibiting ability is 
encoded in its genome and can be unlocked using bioinformatics, functional 
genomics, and proteomics. By these means, the invention utilizes sequence 
information from the genomics of bacteriophage to identify novel antimicrobials that 

25 can be further used to actively and/or prophylactically treat bacterial infection. 

Two important components of the invention thus are: i) the identification of 
bacteria-inhibiting phage open reading frames ("ORF's) and corresponding products 
that can be used to develop antibiotics based on amino acid sequence and secondary 
structural characteristics of the ORF products, and ii) the use of bacteriophages to map 

30 out essential bacterial target genes and homologs, which can in turn lead to the 
development of suitable anti-microbial agents. These two avenues represent new and 
general methods for developing novel antimicrobials. 

The invention thus concerns the identification of bacteriophage ORFs that 
supply bacteria-inhibiting functions. In this regard, use of the terms "inhibit", 

35 "inhibition", "inhibitory", and "inhibitor" all refer to a function of reducing a 
biological activity or function. Such reduction in activity or function can, for 
example, be in connection with a cellular component, e.g., an enzyme, or in 
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connection with a cellular process, e.g., synthesis of a particular protein, or in 
connection with an overall process of a cell, e.g., cell growth. In reference to bacterial 
cell growth, for example, an inhibitory effect (i.e., a bacteria-inhibiting effect) may be 
bacteriocidal (killing of bacterial cells) or bacteriostatic (i.e., stopping or at least 
5 slowing bacterial cell growth). The latter slows or prevents cell growth such that 
fewer cells of the strain are produced relative to uninhibited cells over a given period 
of time. From a molecular standpoint, such inhibition may equate with a reduction in 
the level of, or elimination of, the transcription and/or translation of a specific 
bacterial target(s), or reduction or elimination of activity of a particular target 
10 biomolecule. 

It is particularly advantageous to evaluate a plurality of different phage ORFs 
for inhibitory activity which may be from one, but is preferably from a plurality of 
q different phage. For example, evaluating ORFs from a number of different phage of 

^ the same bacterial host provides at least two advantages. One is that the multiple 

J 15 phages will provide identification of a variety of different targets. Second, it is likely 
~p that multiple phage will utilize the same cellular target, 

si- As used herein, the terms "bacteriophage" and "phage" are used 

fll interchangeably to refer to a virus which can infect a bacterial strain or a number of 

f different bacterial strains. 

f|| 20 In the context of this invention, the term "bacteriophage ORF" or ""phage 

y ORF" or similar term refers to a nucleotide sequence in or from a bacteriophage. In 

connection with a particular ORF, the terms refer an open reading frame which has at 
ll least 95% sequence identity, preferably at least 97% sequence identity, more 

preferably at least 98% sequence identity with an ORF from the particular phage 
25 identified herein (e.g., with an ORF as identified herein) or to a nucleic acid sequence 
which has the specified sequence identify percentage with such an ORF sequence. 

A first aspect of the invention thus provides a method for identifying a 
bacteriophage nucleic acid coding region encoding a product active on an essential 
bacterial target by identifying a nucleic acid sequence encoding a gene product which 
30 provides a bacteria-inhibiting function when the bacteriophage infects a host 
bacterium, preferably one that is an animal or plant pathogen, more preferably a bird 
or mammalian pathogen, and most preferably a human pathogen. The bacteriophage 
is an uncharacterized bacteriophage. Thus, the method excludes, for example, phage 
X, (j)xl74, ml3 and other £.co//-specific bacteriophage that have been studied with 
35 respect to gene number and/or function. It also excludes, for example, the nucleic 
acid coding regions described in Tables 13-14, and in preferred embodiments, 
excludes the phage in which those regions are naturally located. In preferred 
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embodiments of this and the other aspects of the present invention, the phage is 
Staphylococcus aureus phage 77, 3 A, or 96. 

In connection with bacteriophage, the term "uncharacterized" means that a 
certain bacteriophage's genome has not yet been fully identified such that the genes 
5 having function involved in inhibiting host cells have not been identified. In 
particular, phage for which the description of genomic or protein sequence was first 
provided herein are uncharacterized. Phage sequences for which host bacteria- 
inhibiting functions have been identified prior to the filing of the present application 
(or alternatively prior to the present invention) are specifically excluded from the 
10 aspects involving utilization of sequences from uncharacterized bacteriophage, except 
that aspects may involve a plurality of phage where one or more of those phage are 
uncharacterized and one or more others have been characterized to some extent. A 
O number of different bacteria-inhibiting phage ORFs are indicated in Tables 12-14. 

■|f The phage ORFs or sequences identified therein are not within the term 

III 15 "uncharacterized; alternatively, in preferred embodiments the phage containing those 
f: ORFs are excluded from this term. Further, any additional phage ORFs (or 

]r\ alternatively the phage which contain those ORFs) which have previously been 

111 described in the art as bacteria-inhibiting ORFs are expressly excluded; those ORFs or 

L phage are known to those skilled in the art and the exclusion can be made express by 

\%i 20 specifically naming such ORFs or phage as needed (likewise for uncharacterized 
^ targets as described below). For the sake of brevity, such a listing is not expressly 

ijhj presented, as such information is readily available to those skilled in the art. 

^0 Stating that an agent or compound is "active on" a particular cellular target, 

such as the product of a particular gene, means that the target is an important part of a 
25 cellular pathway which includes that target and that the agent acts on that pathway. 
Thus, in some cases the agent may act on a component upstream or downstream of the 

stated target, including on a regulator of that pathway or a component of that pathway. 
By "essential", in connection with a gene or gene product, is meant that the host 

cannot survive without, or is significantly growth compromised, in the absence 

30 depletion, or alteration of functional product. An "essential gene" is thus one that 

encodes a product that is beneficial, or preferably necessary, for cellular growth in 

vitro in a medium appropriate for growth of a strain having a wild-type allele 

corresponding to the particular gene in question. Therefore, if an essential gene is 

inactivated or inhibited, that cell will grow significantly more slowly, preferably less 

35 than 20%, more preferably less than 10%, most preferably less than 5% of the growth 

rate of the uninhibited wild-type, or not at all, in the growth medium. Preferably, in 
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the absence of activity provided by a product of the gene, the cell will not grow at all 
or will be non-viable, at least under culture conditions similar to the in vivo conditions 
normally encountered by the bacterial cell during an infection. For example, absence 
of the biological activity of certain enzymes involved in bacterial cell wall synthesis 
5 can result in the lysis of cells under normal osmotic conditions, even though 
protoplasts can be maintained under controlled osmotic conditions. In the context of 
the invention, essential genes are generally the preferred targets of antimicrobial 
agents. Essential genes can encode target molecules directly or can encode a product 
involved in the production, modification, or maintenance of a target molecule. 

10 A "target" refers to a biomolecule that can be acted on by an exogenous agent, 

thereby modulating, preferably inhibiting, growth or viability of a cell. In most cases 
'% such a target will be a nucleic acid sequence or molecule, or a polypeptide or protein. 

,p However, other types of biomolecules can also be targets, e.g., membrane lipids and 

"Jf cell wall structural components. 

f|j 15 The term "bacterium" refers to a single bacterial strain, and includes a single 

% cell, and a plurality or population of cells of that strain unless clearly indicated to the 

Til 

contrary. In reference to bacteria or bacteriophage, the term "strain" refers to bacteria 
-f- or phage having a particular genetic content. The genetic content includes genomic 

content as well as recombinant vectors. Thus, for example, two otherwise identical 
111 20 bacterial cells would represent different strains if each contained a vector, e.g., a 
* plasmid, with different phage ORF inserts. 

Preferred embodiments involve expressing at least one recombinant phage 
ORF(s) in a bacterial host followed by inhibition analysis of that host. Inhibition 
following expression of the phage ORF is indicative that the product of the ORF is 
25 active on an essential bacterial target. Such evaluation can be carried out in a variety 
of different formats, such as on a support matrix such as a solidified medium in a petri 
dish, or in liquid culture. Preferably a plurality of phage ORFs are expressed in at 
least one bacterium. The plurality of phage ORFs can be from one or a plurality of 
phage. With respect to a single phage or at least one phage in a plurality of phages, 
30 the plurality of expressed ORFs preferably represents at least 10%, more preferably at 
least 20%, 40%, or 60%, still more preferably at least 80% or 90%, and most 
preferably at least 95% of the ORFs in the phage genome. Preferably, for a plurality 
of phage, the plurality of expressed ORFs preferably represents at least 10%, more 
preferably at least 20%, 40%, or 60%, still more preferably at least 80% or 90%, and 
35 most preferably at least 95% of the ORFs in the phage genome of each phage. The 
plurality of phage ORFs can be expressed in a single bacterium, or in a plurality of 
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bacteria where one ORF is expressed in each bacterium, or in a plurality of bacteria 
where a plurality of ORFs are expressed in at least one or in all of the plurality of 
bacteria, or combinations of these. 

In embodiments of the above aspect (as well as in other aspects herein) in 
which a plurality of phage are utilized, a plurality of phage have the same bacterial 
host species; have different bacterial host species; or both. The plurality of phage 
includes at least two different phage, preferably at least 3,4,5,6,8,10,15,20, or more 
different phage. Indeed, more preferably, the plurality of phage will include 50, 75, 
100, or more phage. As described herein, the larger number of phage is useful to 
provide additional target and target evaluation information useful in developing 
antibacterial agents, for example, by providing identification of a larger range of 
bacterial targets, and/or providing further indication of the suitability of a particular 
target (for example, utilization of a target by a number of different unrelated phage 
can suggest that the target is particularly stable and accessible and effective) and/or 
can indicate alternate sites on a target which interact with different inhibitors. 

Further embodiments involve confirmation of the inhibitor function of the 
phage ORF, such as by utilizing or incorporating a control(s) designed to confirm the 
inhibitory nature of the ORF(s) being evaluated. The control can, for example, be 
provided by expression of an inactive or partially inactive form of the ORF or ORF 
product, and/or by the absence of expression of the ORF or ORF product in the same 
or a closely comparable bacterial strain as that used for expression of the test ORF. 
The reduced level of activity or the absence of active ORF product in the control will 
thus not provide the inhibition provided by a corresponding inhibitory ORF, or will 
provide a distinguishably lower level of inhibition. An inactivated or partially 
inactivated control has a mutation(s), e.g., in the coding region or in flanking 
regulatory elements, that reduce(s) or eliminate(s) the normal function of the ORF. 
Thus, the inhibition of a bacterium following expression of a phage ORF is 
determined by comparison with the effects of expression of an inactivated ORF or the 
response of the bacteria in the absence of expression in the same or similar type 
bacterium. Such determination of inhibition of the bacterium following expression of 
the ORF is indicative of a bacteria-inhibiting function. These manipulations are 
routinely understood and accomplished by those of skill in the art using standard 
techniques. In embodiments utilizing absence of expression of the ORF, the bacteria 
can, for example, contain an empty vector or a vector which allows expression of an 
unrelated sequence which is preferably non-inhibitory. Alternatively, the bacteria 
may have no vector at all. Combinations of such controls or other controls may also 
be utilized as recognized by those skilled in the art. 
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In embodiments involving expression of a phage ORF in a bacterial strain, in 
preferred embodiments that expression is inducible. By "inducible" is meant that 
expression is absent or occurs at a low level until the occurrence of an appropriate 
environmental stimulus provides otherwise. For the present invention such induction 
5 is preferably controlled by an artificial environmental change, such as by contacting a 
bacterial strain population with an inducing compound (i.e., an inducer). However, 
induction could also occur, for example, in response to build-up of a compound 
produced by the bacteria in the bacterial culture, e.g., in the medium. As uncontrolled 
or constitutive expression of inhibitory ORFs can severely compromise bacteria to the 

10 point of eradication, such expression is therefore undesirable in many cases because it 
would prevent effective evaluation of the strain and inhibitor being studied. For 
example, such uncontrolled expression could prevent any growth of the strain 
following insertion of a recombinant ORF, thus preventing determination of effective 
transfection or transformation. A controlled or inducible expression is therefore 

15 advantageous and is generally provided through the provision of suitable regulatory 
elements, e.g., promoter/operator sequences that can be conveniently transcriptionally 
linked to a coding sequence to be evaluated. In most cases, the vector will also 
contain sequences suitable for efficient replication of the vector in the same or 
different host cells and/or sequences allowing selection of cells containing the vector, 

20 i.e., "selectable markers." Further, preferred vectors include convenient primer 
sequences flanking the cloning region from which PCR and/or sequencing may be 
performed. 

As knowledge of the nucleotide sequence of phage ORFs is useful, e.g., for 
assisting in the identification of phage proteins active against essential bacterial host 

25 targets, preferred embodiments involve the sequencing of at least a portion of the 
phage genome in combination with the above methods. This can be done either before 
or after or independent of expression and inhibition of the ORF in the bacteria, and 
provides information on the nature and characteristics of the ORF. Such a portion is 
preferably at least 10%, 20%, 40%, 80%, 90%, or 100% of the phage genome. For 

30 embodiments in which a plurality of phage are utilized, preferably each phage is 
sequenced to an extent as just specified. 

Such sequencing is preferably accompanied by computer sequence analysis to 
define and evaluate ORF(s), ORF products, structural motifs or functional properties 
of ORF products, and/or their genetic control elements. Thus, certain embodiments 

35 incorporate computer sequence analyses or nucleic acid and/or amino acid sequences. 
Further, existing data banks can provide phage sequence and product information 
which can be utilized for analysis and identification of ORFs in the sequence. 
Computer analysis may further employ known homologous sequences from other 
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species that suggest or indicate conserved underlying biochemical function(s) for the 
inhibitory or potentially inhibitory ORF sequence(s) being evaluated. This can 
include the sequences of signature motifs of identified classes of inhibitors. 

In the context of the phage nucleic acid sequences, e.g., gene sequences, of this 
5 invention, the terms "homolog" and "homologous" denote nucleotide sequences from 
different bacteria or phage strains or species or from other types of organisms that 
have significantly related nucleotide sequences, and consequently significantly related 
encoded gene products, preferably having related function. Homologous gene 
sequences or coding sequences have at least 70% sequence identity (as defined by the 

10 maximal base match in a computer-generated alignment of two or more nucleic acid 
sequences) over at least one sequence window of 48 nucleotides, more preferably at 
least 80 or 85%, still more preferably at least 90%, and most preferably at least 95%. 
The polypeptide products of homologous genes have at least 35% amino acid 
sequence identity over at least one sequence window of 18 amino acid residues, more 

15 preferably at least 40%, still more preferably at least 50% or 60%, and most 
preferably at least 70%, 80%, or 90%. Preferably, the homologous gene product is 
also a functional homolog, meaning that the homolog will functionally complement 
one or more biological activities of the product being compared. For nucleotide or 
amino acid sequence comparisons where a homology is defined by a % sequence 

20 identity, the percentage is determined using BLAST programs ( with default 
parameters (Altschul et al., 1997, "Gapped BLAST and PSI-BLAST: a new 
generation of protein database search programs, Nucleic Acid Res. 25:3389-3402). 
Any of a variety of algorithms known in the art which provide comparable results can 
also be used, preferably using default parameters. Performance characteristics for 

25 three different algorithms in homology searching is described in Salamov et al., 1999, 
"Combining sensitive database searches with multiple intermediates to detect distant 
homologues." Protein Eng. 12:95-100. Another exemplary program package is the 
GCG™ package from the University of Wisconsin. 

Homologs may also or in addition be characterized by the ability of two 

30 complementary nucleic acid strands to hybridize to each other under appropriately 
stringent conditions. Hybridizations are typically and preferably conducted with 
probe-length nucleic acid molecules, preferably 20-100 nucleotides in length. Those 
skilled in the art understand how to estimate and adjust the stringency of hybridization 
conditions such that sequences having at least a desired level of complementarity will 

35 stably hybridize, while those having lower complementarity will not. For examples of 
hybridization conditions and parameters, see, e.g.,. Maniatis, T. et al. (1989) 
Molecular Cloning: A Laboratory Manual Cold Spring Harbor University Press, Cold 
Spring, N.Y.; Ausubel, F.M. et al. (1994) Current Protocols in Molecular Biology . 
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John Wiley & Sons, Secaucus, N J. Homologs and homologous gene sequences may 
thus be identified using any nucleic acid sequence of interest, including the phage 
ORFs and bacterial target genes of the present invention, 

A typical hybridization, for example, utilizes, besides the labeled probe of 
interest, a salt solution such as 6xSSC (NaCl and Sodium Citrate base) to stabilize 
nucleic acid strand interaction, a mild detergent such as 0.5% SDS, together with 
other typical additives such as Denhardt's solution and salmon sperm DNA. The 
solution is added to the immobilized sequence to be probed and incubated at suitable 
temperatures to preferably permit specific binding while minimizing nonspecific 
binding. The temperature of the incubations and ensuing washes is critical to the 
success and clarity of the hybridization. Stringent conditions employ relatively higher 
temperatures, lower salt concentrations, and/or more detergent than do non-stringent 
conditions. Hybridization temperatures also depend on the length, complementarity 
level, and nature (ie, "GC content") of the sequences to be tested. Typical stringent 
hybridizations and washes are conducted at temperatures of at least 40°C, while lower 
stringency hybridizations and washes are typically conducted at 37°C down to room 
temperature (~25°C). One of skill in the art is aware that these conditions may vary 
according to the parameters indicated above, and that certain additives such as 
formamide and dextran sulphate may also be added to affect the conditions. 

By "stringent hybridization conditions" is meant hybridization conditions at 
least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 mM 
NaH 2 P0 4 , pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, and 5X 
Denhart's solution at 42°C overnight; washing with 2X SSC, 0.1% SDS at 45°C; and 
washing with 0.2X SSC, 0.1% SDS at 45°C. 

In sequence comparison analyses, an ORF, or motif, or set of motifs in a 
bacteriophage sequence can be compared to known inhibitor sequences, e.g., 
homologous sequences encoding homologous inhibitors of bacterial function. 
Likewise, the analysis can include comparison with the structure of essential bacterial 
gene products, as structural similarities can be indicative of similar or replacement 
biological function. Such analysis can include the identification of a signature, or 
characteristic motif(s) of an inhibitor or inhibitor class. 

Also, the identification of structural motifs in an encoded product, based on 
nucleotide or amino acid sequence analysis, can be used to infer a biochemical 
function for the product. A database containing identified structural motifs in a large 
number of sequences is available for identification of motifs in phage sequences. The 
database is PROSITE, which is available at www.expasy.ch/cgi-bin/scanprosite. The 
identification of motifs can, for example, include the identification of signature motifs 
for a class or classes of inhibitory proteins. Other such databases may also be used. 
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In aspects and preferred embodiments described herein, in which a bacterium 
or host bacterium is specified, the bacterium or host bacterium is preferably selected 
from a pathogenic bacterial species, for example, one selected from Table 1. 
Preferably, an animal or plant pathogen is used. For animals, preferably the bacterium 
5 is a bird or mammalian pathogen, still more preferably a human pathogen. 

In aspects and preferred embodiments involving a bacteriophage or sequences 
from a bacteriophage, one or more bacteriophage are preferably selected from those 
listed in Table 1 in the Detailed Description below. Those exemplary bacteriophge 
are readily obtained from the indicated sources. 
10 In some cases, it is advantageous to utilize phage with non-pathogenic host 

bacteria. The genome, structural motif, ORF, homolog, and other analyses described 
herein can be performed on such phage and bacteria. Such analysis provides useful 
p information and compositions. The results of such analyses can also be utilized in 

|f aspects of the present invention to identify homologous ORFs, especially inhibitor 

Iji 1 5 ORFs in phage with pathogenic bacterial hosts. Similarly, identification of a target in 
;P a non-pathogenic host can be used to identify homologous sequences and targets in 

I fl pathogenic bacteria, especially in genetically closely related bacteria. Those skilled in 

f!J the art are familiar with bacterial genetic relationships and with how to determine 

? « relatedness based on levels of genomic identity or other measures of nucleotide 

f|! 20 sequence and/or amino acid sequence similarity, and/or other physical and culture 
^ characteristics such as morphology, nutritional requirements, or minimal media to 

S support growth. 

© Also in preferred embodiments, an embodiments of this aspect is combined 

with an embodiment of the following aspect. 

25 A related aspect of the invention provides methods for identifying a target for 

antibacterial agents by identifying the bacterial target(s) of at least one 
uncharacterized or untargeted inhibitor protein or RNA from a bacteriophage. Such 
identification allows the development of antibacterial agents active on such targets. 
Preferred embodiments for identifying such targets involve the identification of 

30 binding of target and phage ORF products to one another. The phage ORF products 
may be subportions of a larger ORF product that also binds the host target. In 
preferred embodiments, the phage protein or RNA is from an uncharacterized 
bacteriophage in Table 1. This aspect preferably includes the identification of a 
plurality of such targets in one or a plurality of different bacteria, preferably in one or 

35 a plurality of bacteria listed in Table 1 . 

In preferred embodiments of this aspect and other aspects of this invention 
involving particular phage ORFs or phage sequences, the ORF is Staphylococcus 
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aureus phage 77 ORF 17, 19, 43, 102, 104, or 182 as identified in U.S. application 
09/407,804. 

As indicated for the above aspect, preferably the method involves the use of a 
plurality of different phage, and thus a plurality of different phage inhibitors and/or 
inhibitor ORFs. 

In addition to uncharacteized phage ORF products, it is also useful to identify 
the targets of phage ORF products which are known to be inhibitors of host bacteria, 
but where the target has not been identified. Thus, such inhibitors can likewise be 
utilized as "untargeted" inhibitor phage ORFs and ORF products, e.g., proteins or 
RNAs. 

In the context of inhibitor proteins or RNAs from a phage, the term 
"uncharacterized" means that a bacteria-inhibiting function for the protein has not 
previously been identified. Preferably, but not necessarily, the sequence of the protein 
or the corresponding coding region or ORF was not described in the art before the 
filing of the present application for patent (or alternatively prior to the present 
invention). Thus, this term specifically excludes any bacteria-inhibiting phage protein 
and its associated bacterial target which has been identified as inhibitory before the 
present invention or alternatively before the filing of the present application, for 
example those identified in Tables 12-14 or otherwise identified herein. For example, 
from E. coli, phage T7 genes 0.7 and 2.0 target the host RNA polymerase, phage T4 
gp55/gp33 alter the specificity of host RNA polymerase. The T4 regB gene product 
also targets the host translation apparatus. As with the uncharacterized bacteriophage 
ORFs or bacteriophage above, for such identified proteins, the sequences encoding 
those proteins are excluded from the uncharacterized inhibitor proteins. 

The term "fragment" refers to a portion of a larger molecule or assembly. For 
proteins, the term "fragment" refers to a molecule which includes at least 5 contiguous 
amino acids from the reference polypeptide or protein, preferably at least 8, 10, 12, 
15, 20, 30, 50 or more contiguous amino acids. In connection with oligo- or 
polynucleotides, the term "fragment" refers to a molecule which includes at least 15 
contiguous nucleotides from a reference polynucleotide, preferably at least 24, 30, 36, 
45, 60, 90, 150, or more contiguous nucleotides. 

Preferred embodiments involve identification of binding that include methods 
for distinguishing bound molecules, for example, affinity chromatography, 
immunoprecipitation, crosslinking, and/or genetic screen methods that permit 
proteinrprotein interactions to be monitored. One of skill in the art is familiar with 
these techniques and common materials utilized (see, e.g., Coligan, J. et al. (eds.) 
(1995) Current Protocols in Protein Science. John Wiley & Sons, Secaucus, N.J.). 
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Genetic screening for the identification of protein:protein interactions typically 
involves the co-introduction of both a chimeric bait nucleic acid sequence (here, the 
phage ORF to be tested) and a chimeric target nucleic acid sequence that, when co- 
expressed and having affinity for one another in a host cell, stimulate reporter gene 
5 expression to indicate the relationship. A "positive" can thus suggest a potential 
inhibitory effect in bacteria. This is discussed in further detail in the Detailed 
Description section below. In this way, new bacterial targets can be identified that are 
inhibited by specific phage ORF products or derivatives, fragments, mimetics, or 
other molecules. 

10 Other embodiments involve the identification and/or utilization of mutant 

targets by virtue of their host's relatively unresponsive nature in the presence of 
expression of ORFs previously identified as inhibitory to the non-mutant or wild-type 
strain. Such mutants have the effect of protecting the host from an inhibition that 
would otherwise occur and indirectly allow identification of the precise responsible 

15 target for follow-up studies and anti-microbial development. In certain embodiments, 
rescue from inhibition occurs under conditions in which a bacterial target or mutant 
target is highly expressed. This is performed, for example, through coupling of the 
sequence with regulatory element promoters, e.g., as known in the art, which regulate 
expression at levels higher than wild-type, e.g., at a level sufficiently higher that the 

20 inhibitor can be competitively bound to the highly expressed target such that the 
bacterium is detectably less inhibited. 

Identification of the bacterial target can involve identification of a phage- 
specific site of action. This can involve a newly identified target, or a target where the 
phage site of action differs from the site of action of a previously known antibacterial 

25 agent or inhibitor. For example, phage T7 genes 0.7 and 2.0 target the host RNA 
polymerase, which is also the cellular target for the antibacterial agent, rifampin. To 
the extent that a phage product is found to act at a different site than previously 
described inhibitors, aspects of the present invention can utilize those new, phage- 
specific sites for identification and use of new agents. The site of action can be 

30 identified by techniques well-known to those skilled in the art, for example, by 
mutational analysis, binding competition analysis, and/or other appropriate 
techniques. 

Once a bacterial host target protein or nucleic acid or mutant target sequence 
has been identified and/or isolated, it too can be conveniently sequenced, sequence 
35 analyzed (e.g. , by computer), and the underlying gene(s), and corresponding translated 
product(s) further characterized. Preferred embodiments include such analysis and 
identification. Preferably such a target has not previously been identified as an 
appropriate target for antibacterial action. 
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Certain embodiments include the identification of at least one inhibitory phage 
ORF or ORF product, e.g., as described for the above aspect, and thus are a 
combination of the two aspects. 

Additionally, the invention provides methods for identifying targets for 
5 antibacterial agents by identifying homologs of a Enterococcus sp. target of a 
bacteriophage inhibitory ORF product. Such homologs may be utilized in the various 
aspects and embodiments described herein as describded for the host Enterococcus sp. 
for bacteriophage 182. 

Other aspects of the invention provide isolated, purified, or enriched specific 

10 phage nucleic acid and amino acid sequences, subsequences, and homologs thereof for 
phage selected from uncharacterized phage listed in Table 1, preferably from 
bacteriophage 77, 3A, 96. For example, such sequences do not include sequences 
identified in any of Tables 11-14. Such nucleotide sequences are at least 15 
nucleotides in length, preferably at least 18, 21, 24, or 27 nucleotides in length, more 

15 preferably at least 30, 50, or 90 nucleotides in length. In certain embodiments, longer 
nucleic acids are preferred, for example those of at least 120, 150, 200, 300, 600, 900 
or more nucleotides. Such sequences can, for example, be amplification 
oligonucleotides (e.g., PCR primers), oligonucleotide probes, sequences encoding a 
portion or all of a phage-encoded protein, or a fragment or all of a phage-encoded 

20 protein. In preferred embodiments, the nucleic acid sequence contains a sequence 
which is within a length range with a lower length as specified above, and an upper 
length limit which is no more than 50, 60, 70, 80, or 90% of the length of the 
corresponding full-length ORF. The upper length limit can also be expressed in terms 
of the number of base pairs of the ORF (coding region). In preferred embodiments, 

25 the nucleic acid sequence is from Staphylococcus aureus phage 77 ORF 17, 19, 43, 
102, 104, or 182 as identified in U.S. application 09/407,804. 

As it is recognized that alternate codons will encode the same amino acid for 
most amino acids due to the degeneracy of the genetic code, the sequences of this 
aspect includes nucleic acid sequences utilizing such alternate codon usage for one or 

30 more codons of a coding sequence. For example, all four nucleic acid sequences 
GCT, GCC, GCA, and GCG encode the amino acid, alanine. Therefore, if for an 
amino acid there exists an average of three codons, a polypeptide of 100 amino acids 
in length will, on average, be encoded by 3 100 , or 5 x 10 47 , nucleic acid sequences. 
Thus, a nucleic acid sequence can be modified (e.g., a nucleic acid sequence from a 

35 phage as specified above) to form a second nucleic acid sequence encoding the same 
polypeptide as encoded by the first nucleic acid sequence using routine procedures 
and without undue experimentation. Thus, all possible nucleic acid sequences that 
encode the specified amino acid sequences are also fully described herein, as if all 
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were written out in full, taking into account the codon usage, especially that preferred 
in the host bacterium. The alternate codon descriptions are available in common 
texbooks, for example, Stryer, BIOCHEMISTRY 3 rd ed., and Lehninger, 
BIOCHEMISTRY 3 rd ed. Codon preference tables for various types of organisms are 
5 available in the literature. Sequences with alternate codons at one or more sites can 
also be utilized in the computer-related aspects and embodiments herein. Because of 
the number of sequence variations involving alternate codon usage, for the sake of 
brevity, individual sequences are not separately listed herein. Instead the alternate 
sequences are described by reference to the natural sequence with replacement of one 
1 0 or more (up to all) of the degenerate codons with alternate codons from the alternate 
codon table (Table 6), preferably with selection according to preferred codon usage 
for the normal host organism or a host organism in which a sequence is intended to be 
Q expressed. Those skilled in the art also understand how to alter the alternate codons to 

■|f be used for expression in organisms where certain codons code differently than shown 

ik 15 in the "universal" codon table. 

if: For amino acid sequences or polypeptides, sequences contain at least 5 peptide- 

- h linked amino acid residues, and preferably at least 6, 7, 10, 15, 20, 30, or 40, amino 

111 acids having identical amino acid sequence as the same number of contiguous amino 

r .. acid residues in a particular phage ORF product. In some cases longer sequences may 

III 20 be preferred, for example, those of at least 50, 60, 70, 80, or 100 amino acids in 

length. In preferred embodiments, the amino acid sequence contains a sequence which 
k B is within a length range with a lower length as specified above, and an upper length 

limit which is no more than 50, 60, 70, 80, or 90% of the length of the corresponding 
full-length ORF product. The upper length limit can also be expressed in terms of the 
25 number of amino acid residues of the ORF product. In preferred embodiments, the 
amino acid sequence or polypeptide has bacteria-inhibiting function when expressed 
or otherwise present in a bacterial cell that is a host for the bacteriophage from which 
the sequence was derived. 

By "isolated" in reference to a nucleic acid is meant that a naturally occurring 
30 sequence has been removed from its normal cellular (e.g., chromosomal) environment 
or is synthesized in a non-natural environment (e.g., artificially synthesized). Thus, 
the sequence may be in a cell-free solution or placed in a different cellular 
environment. The term does not imply that the sequence is the only nucleotide chain 
present, but that it is essentially free (about 90-95% pure at least) of non-nucleotide 
35 material naturally associated with it, and thus is distinguished from isolated 
chromosomes. 

The term "enriched" means that the specific DNA or RNA sequence 
constitutes a significantly higher fraction (2-5 fold) of the total DNA or RNA present 
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in the cells or solution of interest than in normal or diseased cells or in cells from 
which the sequence was originally taken. This could be caused by a person by 
preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
5 combination of the two. However, it should be noted that enriched does not imply 
that there are no other DNA or RNA sequences present, just that the relative amount 
of the sequence of interest has been significantly increased. 

The term "significant" is used to indicate that the level of increase is useful to 
the person making such an increase and an increase relative to other nucleic acids of 

10 about at least 2-fold, more preferably at least 5- to 10-fold or even more. The term 
also does not imply that there is no DNA or RNA from other sources. The other 
source DNA may, for example, comprise DNA from a yeast or bacterial genome, or a 
cloning vector such as pUC19. This term distinguishes from naturally occurring 
events, such as viral infection, or tumor type growths, in which the level of one 

1 5 mRNA may be naturally increased relative to other species of mRNA. That is, the 
term is meant to cover only those situations in which a person has intervened to 
elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in 
purified form. The term "purified" in reference to nucleic acid does not require 

20 absolute purity (such as a homogeneous preparation). Instead, it represents an 

indication that the sequence is relatively more pure than in the natural environment 
(compared to the natural level, this level should be at least 2-5 fold greater, e.g., in 
terms of mg/mL). Individual clones isolated from a cDNA library may be purified to 
electrophoretic homogeneity. The claimed DNA molecules obtained from these 

25 clones could be obtained directly from total DNA or from total RNA. The cDNA 

clones are not naturally occurring, but rather are preferably obtained via manipulation 
of a partially purified naturally occurring substance (messenger RNA). The 
construction of a cDNA library from mRNA involves the creation of a synthetic 
substance (cDNA) and pure individual cDNA clones can be isolated from the 

30 synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the 
process which includes the construction of a cDNA library from mRNA and isolation 
of distinct cDNA clones yields an approximately 10 6 -fold purification of the native 
message. Thus, purification of at least one order of magnitude, preferably two or 
three orders, and more preferably four or five orders of magnitude is expressly 

35 contemplated. 

The terms "isolated", "enriched", and "purified" as used with respect to 
nucleic acids, above, may similarly be used to denote the relative purity and 
abundance of polypeptides ( multimers of amino acids joined one to another by cc- 
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carboxyl:a-amino group (peptide) bonds). These, too, may be stored in, grown in, 
screened in, and selected from libraries using biochemical techniques familiar in the 
art. Such polypeptides may be natural, synthetic or chimeric and may be extracted 
using any of a variety of methods, such as antibody immunoprecipitation, other 
5 "tagging" techniques, conventional chromatography and/or electrophoretic methods. 
Some of the above utilize the corresponding nucleic acid sequence. 

As indicated above, aspects and embodiments of the invention are not limited 
to entire genes and proteins. The invention also provides and utilizes fragments and 
portions thereof, preferably those which are "active" in the inhibitory sense described 

1 0 above. Such peptides or oligopeptides and oligo or polynucleotides have preferred 
lengths as specified above for nucleic acid and amino acid sequences from phage; 
corresponding recombinant constructs can be made to express the encoded same. 
Also included are homologous sequences and fragments thereof. 

The nucleotide and amino acid sequences identified herein are believed to be 

15 correct, however, certain sequences may contain a small percentage of errors, e.g., 1- 
5%. In the event that any of the sequences have errors, the corrected sequences can be 
readily provided by one skilled in the art using routine methods. For example, the 
nucleotide sequences can be confirmed or corrected by obtaining and culturing the 
relevant phage, and purifying phage genomic nucleic acids. A region or regions of 

20 interest can be amplified, e.g. , by PCR from the appropriate genomic template, using 
primers based on the described sequence. The amplified regions can then be 
sequenced using any of the available methods (e.g., a dideoxy termination method). 
This can be done redundantly to provide the corrected sequence or to confirm that the 
described sequence is correct. Alternatively, a particular sequence or sequences can 

25 be identified and isolated as an insert or inserts in a phage genomic library and 

isolated, amplified, and sequenced by standard methods. Confirmation or correction 
of a nucleotide sequence for a phage gene provides an amino acid sequence of the 
encoded product by merely reading off the amino acid sequence according to the 
normal codon relationships and/or expressed in a standard expression system and the 

30 polypeptide product sequenced by standard techniques. The sequences described 
herein thus provide unique identification of the corresponding genes and other 
sequences, allowing those sequences to be used in the various aspects of the present 
invention. 

In other aspects the invention provides recombinant vectors and cells 
35 harboring at least one of the phage ORFs or portion thereof, or bacterial target 

sequences described herein. As understood by those skilled in the art, vectors may be 
provided in different forms, including, for example, plasmids, cosmids, and virus- 
based vectors. See, e.g., Maniatis, T. et al. (1989) Molecular Cloning: A Laboratory 
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Manual Cold Spring Harbor University Press, Cold Spring, N.Y.; See also, Ausubel, 
F.M. et al. (eds.) (1994) Current Protocols in Molecular Biology , John Wiley & Sons, 
Secaucus, N. J. 

In preferred embodiments, the vectors will be expression vectors, preferably 
5 shuttle vectors that permit cloning, replication, and expression within bacteria. An 
"expression vector" is one having regulatory nucleotide sequences containing 
transcriptional and translational regulatory information that controls expression of the 
nucleotide sequence in a host cell. Preferably the vector is constructed to allow 
amplification from vector sequences flanking an insert locus. In certain embodiments, 
10 the expression vectors may additionally or alternativley support expression, and/or 
replication in animal, plant and/or yeast cells due to the presence of suitable 
regulatory sequences, e.g., promoters, enhancers, V stabilizing sequences, primer 
q sequences, etc. In preferred embodiments, the promoters are inducible and specific 

•|f for the system in which expression is desired, e.g. , bacteria, animal, plant, or yeast. 

Ill 1 5 The vectors may optionally encode a "tag" sequence or sequences to facilitate protein 
~F purification. Convenient restriction enzyme cloning sites and suitable selective 

l V marker(s) are also optionally included. Such selective markers can be, for example, 

rij antibiotic resistance markers or markers which supply an essential nutritive growth 

f factor to an otherwise deficient mutant host, e.g., tryptophan, histidine, or leucine in 

ry 20 the Yeast Two-Hybrid systems described below. 

H The term "recombinant vector" relates to a single- or double-stranded circular 

^ nucleic acid molecule that can be transfected into cells and replicated within or 

tfl independently of a cell genome. A circular double-stranded nucleic acid molecule can 

be cut and thereby linearized upon treatment with appropriate restriction enzymes. An 
25 assortment of nucleic acid vectors, restriction enzymes, and the knowledge of the 
nucleotide sequences cut by restriction enzymes are readily available to those skilled 
in the art. A nucleic acid molecule encoding a desired product can be inserted into a 
vector by cutting the vector with restriction enzymes and ligating the two pieces 
together. Preferably the vector is an expression vector, e.g., a shuttle expression 
30 vector as described above. 

By " recombinant cell" is meant a cell possessing introduced or engineered 
nucleic acid sequences, e.g., as described above. The sequence may be in the form of 
or part of a vector or may be integrated into the host cell genome. Preferably the cell 
is a bacterial cell. 

35 In another aspect, the invention also provides methods for identifying and/or 

screening compounds "active on" at least one bacterial target of a bacteriophage 
inhibitor protein or RNA. Preferred embodiments involve contacting such a bacterial 
target or targets (e.g., bacterial target proteins) with a test compound, and determining 
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whether the compound binds to or reduces the level of activity of the bacterial target 
(e.g., a bacterial target protein). Preferably this is done either in vivo (i.e., in a cell- 
based assay) or in vitro, e.g., in a cell-free system under approximately physiological 
conditions. 

The compounds that can be used may be large or small, synthetic or natural, 
organic or inorganic, proteinaceous or non-proteinaceous. In preferred embodiments, 
the compound is a peptidomimetic, as described herein, a bacteriophage inhibitor 
protein or fragment or derivative thereof, preferably an "active portion", or a small 
molecule. 

In particular embodiments, the methods include the identification of bacterial 
targets or the site of action of an inhibitor on a bacterial target as described above or 
otherwise described herein. 

In embodiments involving binding assays, preferably binding is to a fragment 
or portion of a bacterial target protein, where the fragment includes less than 90%, 
80%, 70%, 60%, 50%, 40%, or 30% of an intact bacterial target protein. Preferably, 
the at least one bacterial target includes a plurality of different targets of 
bacteriophage inhibitor proteins, preferably a plurality of different targets. The 
plurality of targets can be in or from a plurality of different bacteria, but preferably is 
from a single bacterial species. 

A "method of screening" refers to a method for evaluating a relevant activity 
or property of a large plurality of compounds (e.g., a bacteria-inhibiting activity), 
rather than just one or a few compounds. For example, a method of screening can be 
used to conveniently test at least 100, more preferably at least 1000, still more 
preferably at least 10,000, and most preferably at least 100,000 different compounds, 
or even more. 

In the context of this invention, the term "small molecule" refers to 
compounds having molecular mass of less than 2000 Daltons, preferably less than 
1500, still more preferably less than 1000, and most preferably less than 600 Daltons. 
Preferably but not necessarily, a small molecule is not an oligopeptide. 

In a related aspect or in preferred embodiments, the invention provides a 
method of screening for potential antibacterial agents by determining whether any of a 
plurality of compounds, preferably a plurality of small molecules, is active on at least 
one target of a bacteriophage inhibitor protein or RNA. Preferred embodiments 
include those described for the above aspect, including embodiments which involve 
determining whether one or more test compounds bind to or reduce the level of 
activity of a bacterial target, and embodiments which utilize a plurality of different 
targets as described above. 
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The identification of bacteria-inhibiting phage ORFs and their encoded 
products also provides a method for identifying an active portion of such an encoded 
product. This also provides a method for identifying a potential antibacterial agent by 
identifying such an active portion of a phage ORF or ORF product. In preferred 
5 embodiments, the identification of an active portion involves one or more of 
mutational analysis, deletion analysis, or analysis of fragments of such products. The 
method can also include determination of a 3-dimensional structure of an active 
portion, such as by analysis of crystal diffraction patterns. In further embodiments, 
the method involves constructing or synthesizing a peptidomimetic compound, where 

10 the structure of the peptidomimetic compound corresponds to the structure of the 
active portion. In this context, "corresponds" means that the peptidomimetic 
compound structure has sufficient similarities to the structure of the active portion that 
the peptidomimetic will interact with the same molecule as the phage protein and 
preferably will elicit at least one cellular response in common which relates to the 

1 5 inhibition of the cell by the phage protein. 

The methods for identifying or screening for compounds or agents active on a 
bacterial target of a phage-encoded inhibitor can also involve identification of a 
phage-specific site of action on the target. 

Preferably in the methods for identifying or screening for compounds active 

20 on such a bacterial target, the target is uncharacterized; the target is from an 
uncharacterized bacterium from Table 1; the site of action is a phage-specfic site of 
action. 

Further embodiments include the identification of inhibitor phage ORFs and 
bacterial targets as in aspects above. 

25 An "active portion" as used herein denotes an epitope, a catalytic or regulatory 

domain, or a fragment of a bacteriophage inhibitor protein that is responsible for, or a 
significant factor in, bacterial target inhibition. The active portion preferably may be 
removed from its contiguous sequences and, in isolation, still effect inhibition. 

By "mimetic" is meant a compound structurally and functionally related to a 

30 reference compound that can be natural, synthetic, or chimeric. In terms of the present 
invention, a "peptidomimetic," for example, is a compound that mimics the activity- 
related aspects of the 3-dimensional structure of a peptide or polyeptide in a non- 
peptide compound, for example mimics the structure of a peptide or active portion of 
a phage- or bacterial ORF-encoded polypeptide. 

35 A related aspect provides a method for inhibiting a bacterial cell by contacting 

the bacterial cell with a compound active on a bacterial target of a bacteriophage 
inhibitor protein or RNA, where the target was uncharacterized. In preferred 
embodiments, the compound is such a protein, or a fragment or derivative thereof; a 
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structural mimetic, e.g., a peptidomimetic, of such a protein or fragment; a small 
molecule; the contacting is performed in vitro, the contacting is performed in vivo in 
an infected or at risk organism, e.g., an animal such as a mammal or bird, for example, 
a human, or other mammal described herein; the bacterium is selected from a genus 
5 and/or species listed in Table 1; the bacteriophage inhibitor protein is uncharacterized; 
and the bacteriophage inhibitor protein is from an uncharacterized phage listed in 
Table L 

In the context of targets in this invention, the term "uncharacterized" means 
that the target was not recognized as an appropriate target for an antibacterial agent 
10 prior to the filing of the present application or alternatively prior to the present 
invention. Such lack of recognition can include, for example, situations where the 
target and/or a nucleotide sequence encoding the target were unknown, situations 
p where the target was known, but where it had not been identified as an appropriate 

target or as an essential cellular component, and situations where the target was 
l?l 15 known as essential but had not been recognized as an appropriate target due to a belief 
J that the target would be inaccessible or otherwise that contacting the cell with a 

Ijl compound active on the target in vitro would be ineffective in cellular inhibition, or 

Rl ineffective in treatment of an infection. Methods described herein utilizing bacterial 

y: targets, e.g., for inhibiting bacteria or treating bacterial infections, can also utilize 

Tk} 20 "uncharacterized target sites", meaning that the target has been previously recognized 
as an appropriate target for an antibacterial agent, but where an agent or inhibitor of 
?|i the invention is used which acts at a different site than that at which the previously 

^ utilized antibacterial agent, Le. 9 a phage-specific site. Preferably the phage-specific 

site has different functional characteristics from the previously utilized site. In the 
25 context of targets or target sites, the term "phage-specific" indicates that the target or 
site is utilized by at least one bacteriophage as an inhibitory target and is different 
from previously identified targets or target sites. 

In the context of this invention, the term "bacteriophage inhibitor protein" 
refers to a protein encoded by a bacteriophage nucleic acid sequence which inhibits 
30 bacterial function in a host bacterium. Thus, it is a bacteria-inhibiting phage product. 

In the context of this invention, the phrase "contacting the bacterial cell with a 
compound active on a bacterial target of a bacteriophage inhibitor protein" or 
equivalent phrases refer to contacting with an isolated, purified, or enriched 
compound or a composition including such a compound, but specifically does not rely 
35 on contacting the bacterial cell with an intact phage which encodes the compound. 
Preferably no intact phage are involved in the contacting. 

Related aspects provide methods for prophylactic or therapeutic treatment of a 
bacterial infection by administering to an infected, challenged or at risk organism a 
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therapeutically or prophylactically effective amount of a compound active on a target 

of a bacteriophage inhibitor protein or RNA, or as described for the previous aspect. 

Preferably the bacterium involved in the infection or risk of infection produces the 

identified target of the bacteriophage inhibitor protein or alternatively produces a 
5 homologous target compound. In preferred embodiments, the host organism is a plant 

or animal, preferably a mammal or bird, and more preferably, a human or other 

mammal described herein. Preferred embodiments include, without limitation, those 

as described for the preceding aspect. 

Compounds useful for the methods of inhibiting, methods of treating, and 
10 pharmaceutical compositions can include novel compounds, but can also include 

compounds which had previously been identified for a purpose other than inhibition 

of bacteria. Such compounds can be utilized as described and can be included in 

pharmaceutical compositions. 

In preferred embodiments of this and other aspects of the invention utilizing 
15 bacterial target sequences of a bacteriophage inhibitory ORF product, the target 

sequence is encoded by a Staphylococcus nucleic acid coding sequence, preferably S. 

aureus. Possible target sequences are described herein by reference to sequence 

source sites. 

The amino acid sequence of a polypeptide target is readily provided by 

20 translating the corresponding coding region. For the sake of brevity, the sequences 
are not reproduced herein. For the sake of brevity, the sequences are described by 
reference to the GenBank entries instead of being written out in full herein. In cases 
where the TIGR or GenBank entry for a coding region is not complete, the complete 
sequence can be readily obtained by routine methods, e.g., by isolating a clone in a 

25 phage host genomic library, and sequencing the clone insert to provide the relevant 
coding region. The boundaries of the coding region can be identified by conventional 
sequence analysis and/or by expression in a bacterium in which the endogenous copy 
of the coding region has been inactivated and using subcloning to identify the 
functional start and stop codons for the coding region. 

30 In the context of nucleic acid or amino acid sequences of this invention, the 

term "corresponding" indicates that the sequence is at least 95% identical, preferably 
at least 97% identical, and more preferably at least 99% identical to a sequence from 
the specified phage genome, a ribonucleotide equivalent, a degenerate equivalent 
(utilizing one or more degenerate codons), or a homologous sequence, where the 

35 homolog provides functionally equivalent biological function. 

By "treatment" or "treating" is meant administering a compound or 
pharmaceutical composition for prophylactic and/or therapeutic purposes. The term 
"prophylactic treatment" refers to treating a patient or animal that is not yet infected 
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but is susceptible to or otherwise at risk of a bacterial infection. The term "therapeutic 
treatment" refers to administering treatment to a patient already suffering from 
infection. 

The term "bacterial infection" refers to the invasion of the host organism, 
5 animal or plant, by pathogenic bacteria. This includes the excessive growth of bacteria 
which are normally present in or on the body of the organism, but more generally, a 
bacterial infection can be any situation in which the presence of a bacterial 
population(s) is damaging to a host organism. Thus, for example, an organism suffers 
from a bacterial population when excessive numbers of a bacterial population are 

10 present in or on the organism's body, or when the effects of the presence of a bacterial 
population(s) is damaging to the cells, tissue, or organs of the organism. 

The terms "administer", "administering", and "administration" refer to a 
method of giving a dosage of a compound or composition, e.g., an antibacterial 
pharmaceutical composition, to an organism. Where the organism is a mammal, the 

15 method is, e.g., topical, oral, intravenous, transdermal, intraperitoneal, intramuscular, 
or intrathecal. The preferred method of administration can vary depending on various 
factors, e.g., the components of the pharmaceutical composition, the site of the 
potential or actual bacterial infection, the bacterium involved, and the infection 
severity. 

20 The term "mammal" has its usual biological meaning referring to any 

organism of the Class Mammalia of higher vertebrates that nourish their young with 
milk secreted by mammary glands, e.g., mouse, rat, and, in particular, human, bovine, 
sheep, swine, dog, and cat. 

In the context of treating a bacterial infection a "therapeutically effective 

25 amount" or "pharmaceutical^ effective amount" indicates an amount of an 

antibacterial agent, e.g., as disclosed for this invention, which has a therapeutic effect. 
This generally refers to the inhibition, to some extent, of the normal cellular 
functioning of bacterial cells that renders or contributes to bacterial infection. 
The dose of antibacterial agent that is useful as a treatment is a 

30 "therapeutically effective amount." Thus, as used herein, a therapeutically effective 
amount means an amount of an antibacterial agent that produces the desired 
therapeutic effect as judged by clinical trial results and/or animal models. This amount 
can be routinely determined by one skilled in the art and will vary depending on 
several factors, such as the particular bacterial strain involved and the particular 

35 antibacterial agent used. 

In connection with claims to methods of inhibiting bacteria and therapeutic or 
prophylactic treatments, "a compound active on a target of a bacteriophage inhibitor 
protein" or terms of equivalent meaning differ from administration of or contact with 
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an intact phage naturally encoding the full-length inhibitor compound. While an 
intact phage may conceivably be incorporated in the present methods, the method at 
least includes the use of an active compound as specified different from a full length 
inhibitor protein naturally encoded by a bacteriophage and/or a delivery or contacting 
5 method different from administration of or contact with an intact phage encoding the 
full-length protein. Similarly, pharmaceutical compositions described herein at least 
include an active compound different from a full-length inhibitor protein naturally 
encoded by a bacteriophage or such a full-length protein is provided in the 
composition in a form different from being encoded by an intact phage. Preferably 

10 the methods and compositions do not include an intact phage. 

In accord with the above aspects, the invention also provides antibacterial 
agents and compounds active on bacterial targets of bacteriophage inhibitor proteins 
or RNAs, where the target was uncharacterized as indicated above. As previously 
indicated, such active compounds include both novel compounds and compounds 

15 which had previously been identified for a purpose other than inhibition of bacteria. 
Such previously identified biologically active compounds can be used in 
embodiments of the above methods of inhibiting and treating. In preferred 
embodiments, the targets, bacteriophage, and active compound are as described herein 
for methods of inhibiting and methods of treating. Preferably the agent or compound 

20 is formulated in a pharmaceutical composition which includes a pharmaceutically 
acceptable carrier, excipient, or diluent. In addition, the invention provides agents, 
compounds, and pharmaceutical compositions where an active compound is active on 
an uncharacterized phage-specific site. 

In preferred embodiments, the target is as described for embodiments of 

25 aspects above. 

Likewise, the invention provides a method of making an antibacterial agent. 
The method involves identifying a target of a bacteriophage inhibitor polypeptide or 
protein or RNA, screening a plurality of compounds to identify a compound active on 
the target, and synthesizing the compound in an amount sufficient to provide a 

30 therapeutic effect when administered to an organism infected by a bacterium naturally 
producing the target. In preferred embodiments, the identification of the target and 
identification of active compounds include steps or methods and/or components as 
described above (or otherwise herein) for such identification. Likewise, the active 
compound can be as described above, including fragments and derivatives of phage 

35 inhibitor proteins, peptidomimetics, and small molecules. As recognized by those 
skilled in the art, peptides can be synthesized by expression systems and purified, or 
can be synthesized artificially. 
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As indicated above, sequence analysis of nucleotide and/or amino acid 
sequences can beneficially utilize computer analysis. Thus, in additional aspects the 
invention provides computer-related hardware and media and methods utilizing and 
incorporating sequence data from uncharacterized phage, e.g., uncharacterized phage 
5 listed in Table 1, preferably at least one of bacteriophage 77, 3 A, and 96, 

(Staphylococcus aureus phage). In general, such aspects can facilitate the above 
described aspects. Various embodiments involve the analysis of genetic sequence and 
encoded products, as applied to the evaluating bacteriophage inhibitor ORFs and 
compounds and fragments related thereto. The various sequence analyses, as well as 
10 function analyses, can be used separately or in combination, as well as in preceding 
aspects and embodiments. Use in combination is often advantageous as the additional 
information allows more efficient prioritizing of phage ORFs for identification of 
Q those ORFs that provide bacteria-inhibiting function. 

•|f In one aspect, the invention provides a computer-readable device which 

(t\ 15 includes at least one recorded amino acid or nucleotide sequence corresponding to one 
J; of the specified phage and a sequence analysis program for analyzing a nucleotide 

; jj and/or amino acid sequence. The device is arranged such that the sequence 

fU information can be retrieved and analyzed using the analysis program. The analysis 

J\ can identify, for example, homologous sequences or the indicated %s of the phage 

ft! 20 genome and structural motifs. Preferably the sequence includes at least 1 phage ORF 
|j or encoded product, more preferably at least 10%, 20%, 30%, 40%, 50%, 70%, 90%, 

yp or 100% of the genomic phage ORFs and/or equivalent cDNA, RNA, or amino acid 

^ sequences. Preferably the sequence or sequences in the device are recorded in a 

medium such as a floppy disk, a computer hard drive, an optical disk, computer 
25 random access memory (RAM), or magnetic tape. The program may also be recorded 

in such medium. The sequences can also include sequences from a plurality of 

different phage. 

In this context, the term "corresponding" indicates that the sequence is at least 
95% identical, preferably at least 97% identical, and more preferably at least 99% 

30 identical to a sequence from the specified phage genome, a ribonucleotide equivalent, 
a degenerate equivalent (utilizing one or more degenerate codons), or a homologous 
sequence, where the homolog provides functionally equivalent biological function. 

Similarly, the invention provides a computer analysis system for identifying 
biologically important portions of a bacteriophage genome. The system includes a 

35 data storage medium, e.g., as identified above, which has recorded thereon a 

nucleotide sequence corresponding to at least a portion of at least one uncharacterized 
bacteriophage genome, a set of program instructions to allow searching of the 
sequence or sequences to analyze the sequence, and an output device where the 
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portion includes at least the sequence length as specified in the preceding aspect. The 
output device is preferably a printer, a video display, or a recording medium. More 
one than one output device may be included. For each of the present computer-related 
asepcts, the bacteriophage are preferably selected from the uncharacterized phage 
listed in Table 1, more preferably from bacteriophage 77, 3 A, and 96. 

In keeping with the computer device aspects, the invention also provides a 
method for identifying or characterizing a bacteriophage ORF by providing a 
computer-based system for analyzing nucleotide or amino acid sequences, e.g., as 
describe above. The system includes a data storage medium which has recorded a 
sequences or sequences as described for the above devices, a set of instructions as in 
the preceding aspect, and an output device as in the preceding aspect. The method 
further involves analyzing at least one sequence, and outputting the analysis results to 
at least one output device. 

In preferred embodiments, the analysis identifies a sequence similarity or 
homology with a sequence or sequences selected from bacterial ORFs encoding 
products with related biological function; ORFs encoding known inhibitors; and 
essential bacterial ORFs. Preferably the analysis identifies a probable biological 
function based on identification of structural elements or characteristic or signature 
motifs of an encoded product or on sequence similarity or homology. Preferably the 
uncharacterized bacteriophage is from Table 1, more preferably at least one of 
bacteriophage 77, 3A, and 96. In preferred embodiments, the method also involves 
determining at least a portion of the nucleotide sequence of at least one 
uncharacterized bacteriophage as indicated, and recording that sequence on data 
storage medium of the computer-based system. 

As used in the claims to describe the various inventive aspects and 
embodiments, "comprising" means including, but not limited to, whatever follows the 
word "comprising". Thus, use of the term "comprising" indicates that the listed 
elements are required or mandatory, but that other elements are optional and may or 
may not be present. By "consisting of 1 is meant including, and limited to, whatever 
follows the phrase "consisting of 1 . Thus, the phrase "consisting of 1 indicates that the 
listed elements are required or mandatory, and that no other elements may be present. 
By "consisting essentially of is meant including any elements listed after the phrase, 
and limited to other elements that do not interfere with or contribute to the activity or 
action specified in the disclosure for the listed elements. Thus, the phrase "consisting 
essentially of indicates that the listed elements are required or mandatory, but that 
other elements are optional and may or may not be present depending upon whether or 
not they affect the activity or action of the listed elements. 
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Further embodiments will be apparent from the following Detailed Description 
and from the claims. 



BRIEF DESCRIPTION OF THE DRAWINGS 

5 

FIGURE 1 A and IB are flow schematics showing the manipulations necessary to 
convert pT0021, an arsenite inducible vector containing the luciferase gene, into 
pTHA or pTM, two ars inducible vectors. Vector pTHA contains BamH I, Sal I, and 
Hind III cloning sites and a downstream HA epitope tag. Vector pTM contains Bam 
1 0 HI and Hind III cloning sites and no HA epitope tag. 



FIGURE 2 is a schematic representation of the cloning steps involved to place the 
DNA segments of any of ORFs 17/ 19/ 43/ 102/104/182 or other sequences into 
pTHA to assess inhibitory potential For subcloning into pTM or pT0021, Individual 

1 5 ORFs were amplified by the PCR using oligonucleotides targeting the ATG and stop 
codons of the ORFs. Using this strategy, Bam HI and Hind III sites were positioned 
immediately upstream or downstream, respectively of the start and stop codons of 
each ORF. Following digestion with Bam HI and Hind III, the PCR fragments were 
subcloned into the same sites of pT0021 or pTM. Clones were verified by PCR and 

20 direct sequencing. 

FIGURE 3 shows a schematic representation of the functional assays used to 
characterize the bactericidal and bacteriostatic potential of all predicted ORFs (>33 
amino acids) encoded by bacteriophage 77. Fig. 3 A) Functional assay on semi-solid 
25 support media. Fig. 3B) Functional assay in liquid culture. 



FIGURE 4A, B, and C is a bar graph showing the results of a screen in liquid media 
to assess bacteriostatic or bactericidal activity of 93 predicted ORFs (>33 amino 
acids) encoded by bacteriophage 77. Growth inhibition assays were performed as 
30 detailed in the Detailed Description. The relative growth of Staphylococcus aureus 
transformants harboring a given bacteriophage 77 ORF (identified on the bottom of 
the graph), in the absence or presence of arsenite, is plotted relative to growth of a 
Staphylococcus aureus transformant containing ORF 5, a non-toxic bacteriophage 77 
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ORF (which is set at 100%). Each bar represents the average obtained from three 
Staph A transformants grown in duplicate. Bacteriophage 77 ORFs showing 
significant growth inhibition are plotted in red and consist or ORF 17, 19, 102, 104, 
and 182. 

FIGURE 5 shows a block diagram of major components of a general purpose 
computer. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

10 

The invention may be more clearly understood from the following description. 

The tables will first be briefly described. 
^1 Table 1 is a listing of a large number of available bacteriophage that can be 

=f~ readily obtained and used in the present invention. 

; 1 1 1 5 Table 2 shows the complete nucleotide sequence of the genome of 

1 ^ Staphylococcus aureus bacteriophage 77. 

M* Table 3 shows a list of all the ORFs from Bacteriophage 77 that were screened 

Ji{ in the functional assay to identify those with anti-microbial activity. 

Ill Table 4 shows the predicted nucleotide sequence, predicted amino acid 

J5 20 sequence, and physiochemical parameters of ORF 17/19/ 43/ 1 02/ 1 04/ 1 82] . These 
include the primary amino acid sequence of the predicted protein, the average 
molecular weight, amino acid composition, theoretical pi, hydrophobicity map, and 
predicted secondary structure map. 

Table 5 shows homology search results. BLAST analysis was performed with 
25 ORFs 17/ 19/ 43/ 102/ 104/ 182 against NCBI non-redundant nucleotide and 
Swissprot databases. The results of this search indicate that: I) ORF 17 has no 
significant homology to any gene in the NCBI non-NCBI non-redundant nucleotide 
database, II) ORF 19 has significant homology to one gene in the NCBI non- 
redundant nucleotide database - the gene encoding ORF 59 of bacteriophage phi PVL, 
30 III) ORF 43 has significant homology to one gene in the NCBI non-redundant 
nucleotide database - the gene encoding ORF 39 of phi PVL, IV) ORF 102 has 
significant homology to one gene in the NCBI non-redundant nucleotide database - 
the gene encoding ORF 38 of phi PVL, V) ORF 104 has no significant homology to 
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any gene in the NCBI non-redundant nucleotide database, VI) ORF 182 has 
significant homology to one gene in the NCBI non-redundant nucleotide database - 
the gene encoding ORF 39 of phi PVL. 

Table 6 is a table from Alberts et al., MOLECULAR BIOLOGY OF THE 
5 CELL 3 rd ed., showing the redundancy of the "universal" genetic code. 

Table 7 shows the complete nucleotide sequence of Staphylococcus aureus 
bacteriophage 3A. 

Table 8 is a listing of the ORFs identified in Staphylococcus aureus 
bacteriophage 3A. 

10 Table 9 shows the complete nucleotide sequence of Staphylococcus aureus 

bacteriophage 96. 

Table 10 is a listing of the ORFs identified in Staphylococcus aureus 
bacteriophage 96. 

Table 1 1 is a listing of sequences deposited in the NCBI public database 
1 5 (GeneBank) for bacteriophage listed in Table 1 . 

Table 12 is a listing of phage which encode a known lysis function , including 
the identified lysis gene. 

Table 13 is a listing of bacteriophage which encode holin genes, where holin 
genes encode proteins which form pores and eventually enable other enzymes to kill 
20 the host bacterium. 

Table 14 is a listing of bacteriophage which encode kil genes. 

Table 15 is a list of Staphylococcus aureus sequences which may include 
sequences from genes coding for target sequences for the phage 77-encoded 
antimicrobial proteins or peptides. 

25 

Background: 

As indicated in the Summary above, the present invention is concerned with 
the use of bacteriophage coding sequences and the encoded polypeptides or RNA 
transcripts to identify bacterial targets for potential new antibacterial agents. Thus, 

30 the invention concerns the selection of relevant bacteria. Particularly relevant bacteria 
are those which are pathogens of a complex organism such as an animal, e.g., 
mammals, reptiles, and birds, and plants. However, the invention can be applied to 
any bacterium (whether pathogenic or not) for which bacteriophage are available or 
which are found to have cellular components closely homologous to components 

35 targeted by phage of another bacterium, e.g., a pathogenic bacterium, e.g., a 
pathogenic bacterium. 
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Thus, the invention also concerns the bacteriophage which can infect a 
selected bacterium. Identification of ORFs or products from the phage which inhibit 
the host bacterium both provides an inhibitor compound and allows identification of 
the bacterial target affected by the phage-encoded inhibitor. Such targets are thus 
5 identified as potential targets for development of other antibacterial agents or 
inhibitors and the use of those targets to inhibit those bacteria. As indicated above, 
even if such a target is not initially identified in a particular bacterium, such a target 
can still be identified if a homologous target is identified in another bacterium. 
Usually, but not necessarily, such another bacterium would be a genetically closely 

10 related bacterium. Indeed, in some cases, a phage-encoded inhibitor can also inhibit 
such a homologous bacterial cellular component. 

The demonstration that bacteriophage have adapted to inhibiting a host 
bacterium by acting on a particular cellular component or target provides a strong 
indication that that component is an appropriate target for developing and using 

15 antibacterial agents, e.g., in therapeutic treatments. Thus, the present invention 
provides additional guidance over mere identification of bacterial essential genes, as 
the present invention also provides an indication of accessability of the target to an 
inhibitor, and an indication that the target is sufficiently stable over time {e.g., not 
subject to high rates of mutation) as phage acting on that target were able to develop 

20 and persist. Thus, the present invention identifies a subset of essential cellular 
components which are particularly likely to be appropriate targets for development of 
antibacterial agents. 

The invention also, therefore, concerns the development or identification of 
inhibitors of bacteria, in addition to the phage-encoded inhibitory proteins (or RNA 

25 transcripts), which are active on the targets of bacteriophage-encoded inhibitors. As 
described herein, such inhibitors can be of a variety of different types, but are 
preferably small molecules. 

The following description provides preferred methods for developing the 
various aspects of the invention. However, as those skilled in the art will readily 

30 recognize, other approaches can be used to obtain and process relevant information. 
Thus the invention is not limited to the specifically described methods. In addition, 
the following description provides a set of steps in a particular order. That series of 
steps describes the overall development involved in the present invention. However, 
it is clear that individual steps or portions of steps may be usefully practiced 

35 separately, and, further, that certain steps may be performed in a different order or 
even bypassed if appropriate information is already available or is provided by other 
sources or methods. 
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Selecting and Growing Phage, and Isolating DNA 

Conceptually, the first step involves selecting bacterial hosts of interest. 
Preferably, but not necessarily, such hosts will be pathogens of clinical importance. 
Alternatively, because bacteria all share certain fundamental metabolic and structural 
5 features, these features can be targeted for study in one strain, for example a 
nonpathogenic one, and extrapolated to similarly succeed in pathogenic ones. 
Nonpathogenic strains may also exhibit initial advantages in being not only less 
dangerous, but also, for example, in having better growth and culturing characteristics 
and/or better developed molecular biology techniques and reagents. Consequently, 
10 advantageously the invention provides the ability target virtually any bacteria, but 
preferably pathogenic bacteria, with antimicrobial compounds designed and/or 
developed using bacteriophage inhibitory proteins and peptides from phage with non- 
p pathogenic and/or pathogenic hosts. 

^ We have selected Staphylococcus aureus, Streptococcus pneumoniae, various 

15 Enterococci, and Pseudomonas aeruginosa as initial exemplary pathogens. These 
f * bacteria are a major cause of morbidity and mortality in hospital-based infections, and 

I?? the appearance of antibiotics resistance in all three organisms makes it increasingly 

III difficult to treat benign infections involving these organisms. Such infections can 

f , include, for example, otitis media, sinusitis, and skin, and airway infections (Neu, 

f|j 20 H.C. (1992). Science 257, 1064-1073). However, the approach described below is 
y clearly applicable to any human bacterial pathogens including but not restricted to 

,f: Mycobacterium tuberculosis, Nesseria gonorrhoeae, Haemophilus influenza, 

tfl Acinobacter, Escherichia coli, Shigella dysenteria, Streptococcus pyogenes, 

Helicobacter pylori, and Mycoplasma species. This invention can also be applied to 
25 the discovery of anti-bacterial compounds directed against pathogens of animals other 
than humans, for example, sheep, cattle, swine, dogs, cats, birds, and reptiles. 
Similarly, the invention is not limited to animals, but also applies to plants. 

The bacteria are grown according to standard methodologies employed in the 
art, including solid, semi-solid or liquid culturing, which procedures can be found in 
30 or extrapolated from standard sources such as Maloy, S.R., Stewart, V.J., and Taylor, 
R.K. Genetic Analysis of Pathogenic Bacteria (1996) Cold Spring Harbor Laboratory 
Press, or Maniatis, T. et al. (1989) Molecular Cloning: A Laboratory Manual Cold 
Spring Harbor University Press, Cold Spring, N.Y.; or Ausubel, F.M. et al. (1994) 
Current Protocols in Molecular Biology . John Wiley & Sons, Secaucus, NJ. Culture 
35 conditions are selected which are adapted to the particular bacterium generally using 
culture conditions known in the art as appropriate, or adaptations of those conditions. 

Nucleic acids within these bacteria can be routinely extracted through common 
procedures such as described in the above-referenced manuals and as generally known 
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to those skilled in the art. Those nucleic acid stocks can then be used to practice the 
other inventive aspects described below. 

Selection and Growth of Bacteriophage, and Isolation of DNA 
5 The second step involves assembling a group of bacteriophages (phage 

collection) for each of the targeted bacterial hosts. While the invention can be utilized 
with a single bacteriophage for a pathogen or other bacterium, it is preferable to utilize 
a plurality of phage for each bacterium, as comparisons between a plurality of such 
phage provides useful additional information. Non-limiting examples of phage and 

10 sources for some of the above-mentioned pathogenic bacteria are found in Table 1. 
The criteria used to select such phages is that they are infectious for the microbe 
targeted, and replicate in, lyse, or otherwise inhibit growth of the bacterium in a 
measurable fashion. These phages can be very different from one another 
(representing different families), as judged by criteria such as morphology (head, tail, 

15 plate, etc.), and similarity of genome nucleotide sequence (cross-hybridization). Since 
such diverse bacteriophages are expected to block bacterial host metabolism and 
ultimately inhibit by a variety of mechanisms, their combined study will lead to the 
identification of different mechanisms by which the phages independently inhibit 
bacterial targets. Examples include degradation of host DNA (Parson K.A., and 

20 Snustad, D.P. (1975). J. Virol 15, 221-444) and inhibition of host UNA transcription 
(Severinova, E., Severinov, K. and Darst, S.A. (1998/ JMol Biol 279, 9-18). This, 
in turn, yields novel information on phage proteins that can inhibit the targeted 
microbe. As explained below, this 1) forms the basis of novel drug discovery efforts 
based on knowledge of the primary amino acid sequence of the phage inhibitor 

25 protein (e.g., peptide fragments or peptidomimetics) and/or 2) leads to the 
identification of bacterial biochemical pathways, the proteins of which are essential or 
significant for survival of the targeted microbe, and which enzymatic steps or 
chemical reactions can be targeted by classical drug discovery methods using 
molecular inhibitors, for example, small molecule inhibitors. 

30 Bacteriophage are generally either of two types, lytic or filamentous, meaning 

they either outright destroy their host and seek out new hosts after replication, or else 
continuously propogate and extrude progeny phage from the same host without 
destroying it. Regardless of the phage life cycle and type, preferred embodiments 
incorporate phage which impede cell growth in measurable fashion and preferably 

35 stop cell growth. To this end, lytic phage are preferred, although certain nonlytic 
species may also suffice, e.g., if sufficiently bacteriostatic. 

Various procedures that are commonly understood by those of skill in the art 
can be routinely employed to grow, isolate, and purify phage. Such procedures are 
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exemplified by those found in such common laboratory aids such as Maloy, S.R., 
Stewart, V.J., and Taylor, R.K. Genetic Analysis of Pathogenic Bacteria (1996) Cold 
Spring Harbor Laboratory Press; Maniatis, T. et al. (1989) Molecular Cloning: A 
Laboratory Manual Cold Spring Harbor University Press, Cold Spring, N.Y.; and 
5 Ausubel, F.M. et al. (eds.) (1994) Current Protocols in Molecular Biology . John 
Wiley & Sons, Secaucus, NJ. The techniques generally involve the culturing of 
infected bacterial cells that are lysed naturally and/or chemically assisted, for 
example, by the use of an organic solvent such as chloroform that destroys the host 
cells thereby liberating the phage within. Following this, the cellular debris is 
10 centrifiiged away from the supernatant containing the phage particles, and the phage 
then subsequently and selectively precipitated out of the supernatant using various 
methods usually employing the use of alcohols and/or other chemical compounds 
g| such as polyethylene glycol (PEG). The resulting phage can be further purified using 

various density gradient/centrifiigation methodologies. The resulting phage are then 

ss?SR 

111 15 chemically lysed, thereby releasing their nucleic acids that can be conveniently 
4; precipitated out of the supernatant to yield a viral nucleic acid supply of the phage of 

,11 interest. 

Oil Exemplary bacteriophage are indicated in Table 1, along with sources where 

jj\ those phage may be obtained. 

Ill 20 Exemplary bacteria include the reference bacteria for the identified viral 

strains, available from the same sources. 

"J- * 

^ Characterizing Bacteriophage Genomes for ORFs 

The third step involves systematically characterizing the genetic information 

25 contained in the phage genome. Within this genetic information is the sequence of all 
RNAs and proteins encoded by the phage, including those that are essential or 
instrumental in inhibiting their host. This characterization is preferably done in a 
systematic fashion. For example, this can be done by first isolating high molecular 
weight genomic DNA from the phage using standard bacterial lysis methods, followed 

30 by phage purification using density gradient ultracentrifugation, and extraction of 
nucleic acid from the purified phage preparation. The high molecular weight DNA is 
then analyzed to determine its size and to evaluate a proper strategy for its sequencing. 
The DNA is broken down into smaller size fragments by sonication or partial 
digestion with frequently cutting restriction enzymes such as Sau3A to yield 

35 predominantly 1 to 2 kilobase length DNA, which DNA can then be resolved by gel 
electrophoresis followed by extraction from the gel. 

The ends of the fragments are enzymatically treated to render them suitable for 
cloning and the pools of fragments are cloned in a bacterial plasmid to generate a 
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library of the phage genome. Several hundred of these random DNA fragments 
contained in the plasmid vector are isolated as clones after introduction into an 
appropriate bacterium, usually Escherichia coli. They are then individually expanded 
in culture and the DNA from each individual clone is purified. The nucleotide 
5 sequences of the inserts of these clones are determined by standard automated or 
manual methods, using oligonucleotide primers located on either side of the cloning 
site to direct polymerase mediated sequencing (e.g., the Sanger sequencing method or 
a modification of that method). Other sequencing methods can also be used. 

The sequence of individual clones is then deposited in a computer, and 

10 specific software programs (for example Sequencher™, Gene Codes Corp.) are used 
to look for overlap between the various sequences, resulting in ordering of contig 
sequences and ultimately providing the complete sequence of the entire bacteriophage 
genome (one such example is given in Table 2 for Staphylococcus aureus 
bacteriophage 77). This complete nucleotide sequence is preferably determined with 

15 a redundancy of 3- to 5-fold (number of independent sequencing events covering the 
same region) in order to minimize sequencing errors. 

Preferably, the bacterial strain used as a phage host should not possess any 
other innate plasmids, transposons, or other phage or incompatible sequences that 
would complicate or otherwise make the various manipulations and analyses more 

20 difficult. 

Commercially available computer software programs are used to translate the 
nucleotide sequence of the phage to identify all protein sequences encoded by the 
phage (hereafter called open reading frames or ORFs). As phages are known to 
transcribe their genome into RNA from both strands, in both directions, and 

25 sometimes in more than one frame for the same sequence, this exercise is done for 
both strands and in all six possible reading frames. As evolutionary constraints have 
forced the phage to conserve all of its vital protein sequences in as small a genome as 
possible, it is straightforward to identify all the proteins encoded by the phage by 
simple examination of the 6 translation frames of the genome. Once these ORFs are 

30 identified, they are cataloged into a phage proteome database (Table 3 lists ORFs 

identified from phage 77). This analysis is preferably performed for each phage under 
study. The process of ORF identification can be varied depending on the desired 
results. For example, the minimum length for the putative encoded polypeptide can 
be varied, and/or putative coding regions that have an associated Shine-Dalgarno 

35 sequence can be selected. In the case of phage 77 ORFs, such parameter adjustment 
was performed and resulted in the identification of ORFs as listed herein. Different 
parameters had resulted in the identification of the ORFs listed in the preceding U.S. 
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Provisional Application 60/1 10,992, filed December 3, 1998, which is hereby 
incorporated by reference in its entirety. 

Correlation of exemplary ORFs identified in that provisional application and 
as identified herein are shown in the following table: 
5 . 



ORFID 


Genomic 


a.a. 


Start 


ORFID 


Genomic 


a.a. 


Start 


from 


position 


size 


codon 


from 


position 


size 


codon 


60/110,992 








09/407,804 








77ORF016 


2369-24024 


251 


TTG 


77ORF017 


23269-23982 


237 


ATG 


77ORF019 


39845-40501 


218 


ATA 


77ORF019 


39851-40501 


216 


ATG 


77ORF050 


29268-29564 


98 


ATG 


770RF182 


29268-29564 


98 


ATG 


77ORF050 


29268-29564 


98 


ATG 


77ORF043 


29304-29564 


86 


ATG 


77ORF067 


34312-34551 


79 


CTG 


77ORF104 


34393-34551 


52 


ATG 


770RF146 


29051-29212 


53 


ATG 


77ORF102 


29051-29212 


53 


ATG 



Identifying and Characterizing Inhibitory Phage ORFs 

The fourth step entails identifying the phage protein or proteins or RNA 

10 transcripts that have the ability to inhibit their bacterial hosts. This can be 
accomplished, for example, by either or both of two non-mutually exclusive methods. 
The first method makes use of bioinformatics. Over the past few years, a large amount 
of nucleotide sequence information and corresponding translated products have 
become available through large genome sequencing projects for a variety of 

15 organisms including mammals, insects, plants, unicellular eukaryotes (yeast and 
fungi), as well as several bacterial genomes such as E. coli, Mycobacterium 
tuberculosis, Bacillus subtilis, Staphylococcus aureus and many others. Such 
sequences have been deposited in public databases (for example, non-redundant 
sequence database at GenBank and SwissProt protein sequence database) 

20 (http://www.ncbi.nlm.nih.gov)) and can be freely accessed to compare any specific 
query sequence to those present in such databases. For example, GenBank contains 
over 1.6 billion nucleotides corresponding to 2.3 million sequence records. Several 
computer programs and servers (e.g., TBLASTN) have been created to allow the rapid 
identification of homology between any given sequence from one organism to that of 

25 another present in such databases, and such programs are public and available free of 
charge. 

In addition, it has been well established that basic biochemical pathways can 
be conserved in very distant organisms (for example bacteria and man), and that the 
proteins performing the various enzymatic steps in these pathways are themselves 
30 conserved at the amino acid sequence level. Thus, proteins performing similar 
functions (e.g. DNA repair, RNA transcription, RNA translation) have frequently 
preserved key structural signatures, identifiable by similarities across regions of 

SD-63311.4 



248/037 



proteins (domains and motifs). The antimicrobials of the present invention will 
preferably target features and targets that are highly characteristic or conserved in 
microbes, and not higher organisms. 

Most genomes encode individual proteins or groups of proteins that can be 
5 assembled into protein families that have been evolutionary conserved. Therefore, 
similarity between a new query sequence and that of a member of a protein family 
(reference sequences from public databases) can immediately suggest a biochemical 
function for the novel query sequence, which in our case is a phage ORF. 

The sequence homology between individual members of evolutionarily distant 
10 members of a protein family is usually not randomly distributed along the entire 
length of the sequence but is often clustered into "motifs". These correspond to key 
three-dimensional folds that form key catalytic and/or regulatory structures that 
qj perform key biochemical function(s) for the group of proteins. Commercially 

available computer software programs can identify such motifs in a new query 
III 15 sequence, again providing functional information for the query sequence. Such 
structural and functional motifs have also been derived from the combined analysis of 
primary sequence databases (protein sequences) and protein structure databases (X- 
ray crystallography, nuclear magnetic resonance) using so-called "threading" methods 
(RostB,land Sander C. (1996) Ann. Rev. Biophy. Biomol Struct 25,113-136). 
20 Such motifs and folds are themselves deposited in public databases which can 

be directly accessed (for example, SwissProt database; 3D-ALI at EMBL, Heidelberg; 
PROSITE). This basic exercise leads to a structural homology map in which each of 
the phage ORFs has been probed for such similarities, and where initial structural and 
functional hits are identified (selected examples of sequence homologies detected 
25 between individual ORFs from the genome of Staphylococcus aureus bacteriophage 
77 and sequences deposited in public databases are shown in Table 5; listed are the 
proteins showing homologies and the TBLASTN scores quantifying the degree of 
sequence similarity between the two compared sequences). 

This analysis can point out phage proteins with similarity to proteins from 
30 other phages (such as those for E. coli) playing an important role in the basic 
biochemical pathways of the phage (such as DNA replication, RNA transcription, 
tRNAs, coat protein and assembly). Selected examples of such proteins are shown in 
Tble 5. Therefore, this analysis enables identification and elimination of non-essential 
ORFs as candidates for an inhibitor function, as well as the identification of 
35 (potentially) useful ones. 

In addition, this analysis can point out specific ORFs as possible inhibitor 
ORFs. For example these ORFs may encode proteins or enzymes that alter bacterial 
cell structure, metabolism or physiology, and ultimately viability. Examples of such 
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proteins present in the genome of Staphylococcus aureus bacteriophage 77 include 
orfl4 (deoxyuridine triphosphatase from bacteriophage T5), and orflS (sialidase). 

In addition, it is well known that bacterial and eukaryotic viruses can usurp 
pathways from their host in order to use them to their advantage in blocking host 
5 cellular pathways upon infection. The phage can achieve this, for example, by 
overexpressing part or whole host-related sequences which are themselves regulating 
or rate limiting in key biochemical pathways of the host. The identification of 
sequence similarity between phage ORFs and bacterial host genome sequences will be 
highly indicative of such a mechanism (Selected examples of such homologies are 
10 listed in Table 5, e.g. orf4 (homologous to autolysin), orf20 (hypothetical protein from 
Staphyloccus aureus) and or£29 (hypothetical protein from Staphyloccus aureus). 
These ORFs can be analyzed by a standard biochemical approach to directly test their 
£1 inhibitor functions {e.g. , as described below). 

*■ Alternatively, a homology search may reveal that a given phage ORF is related 

15 to a protein present in the databases having an activity known to be inhibitory, {e.g. 
inhibitor of host RNA polymerase by E. coli bacteriophage T7. Such a finding would 
implicate the phage ORF product in a related activity. This will also suggest that a 
new antimicrobial could be derived by a mimetic approach {e.g., peptidomimetic) 
imitating this function or by a small molecule inhibitor to the bacterial target of the 
20 phage ORF, or any steps in the relevant host metabolic pathway, e.g., high throughput 
screening of small molecule libraries. Selected examples of such similarity between 
ORFs of Staphyloccus aureus bacteriophage 77 and proteins with inhibitor functions 
for bacterial hosts are listed in Table 5. These include orf9 (similar to bacteriophage 
PI kilA function), and orf4 (autolysin of Staphylococcus aureus, amidase enzymatic 
25 activity). 

A reason for the biochemical study of individual ORFs for inhibitor function is 
that their expression or overexpression will block cellular pathways of the host, 
ultimately leading to arrest and/or inhibition of host metabolism. In addition, such 
ORFs can alter host metabolism in different ways, including modification of 
30 pathogenicity. Therefore, individual ORFs identified above are expressed, preferably 
overexpressed, in the host and the effect of this expression or overexpression on host 
metabolism and viability is measured. This approach can be systematically applied to 
every ORF of the phage, if necessary, and does not rely on the absolute identification 
of candidate ORFs by bioinformatics. Individual ORFs are resynthesized from the 
35 phage genomic DNA, e.g., by the polymerase chain reaction (PCR), preferably using 
oligonucleotide primers flanking the ORF on either side. These single ORFs are 
preferably engineered so that they contain appropriate cloning sites at their extremities 
to allow their introduction into a new bacterial expression plasmid, allowing 
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propagation in a standard bacterial host such as E. coli, but containing the necessary 
information for plasmid replication in the target microbe such as S. aureus (hereafter 
referred to as shuttle vector). Shuttle vectors and their use are well known in the art. 

Such shuttle vectors preferably also contain regulatory sequences that allow 

5 inducible expression of the introduced ORF. As the candidate ORF may encode an 
inhibitor function that will eliminate the host, it is beneficial that it not be expressed 
prior to testing for activity. Thus, screening for such sequences when expressed in a 
constitutive fashion is less likely to be successful when the inhibitor is lethal. In the 
exemplary inducible system presented in Figures 1 A, IB, and 2, regulatory sequences 

10 from the ars operon of S. aureus are used to direct individual ORF expression in S. 
aureus. The ars operon encodes a series of proteins which normally mediate the 
extrusion of arsenite and other trivalent oxyanions from the cells when they are 
exposed to such toxic substances in their environment. The operon encoding this 
detoxifying mechanism is normally silent and only induced when arsenite-related 

15 compounds are present. (Tauriainen, S. et al. (1997) App. Env. Microbe Vol. 63, No. 
11, p. 4456-4461.) 

Therefore, individual phage ORFs can be expressed in S. aureus in an 
inducible fashion by adding to the culture medium non-toxic arsenite concentrations 
during the growth of individual S. aureus clones expressing such individual phage 

20 ORFs. Toxicity of the phage inhibitor ORF for the host is monitored by reduction or 
arrest of growth under induction conditions, as measured by optical density in liquid 
culture or after plating the induced cultures on solid medium. Subsequently, 
interference of the phage ORF with the host biochemical pathways ultimately leading 
to reduced or arrested host metabolism can be measured by pulse-chase experiments 

25 using radiolabeled precursors of either DNA replication, RNA transcription, or protein 
synthesis. 

Those skilled in the art are familiar with a variety of other inducible systems 
which can also be used for the controlled expression of phage ORFs, including, for 
example, lactose (see e.g., Stratagene's LacSwitch™II system; La Jolla, CA) and 
30 tetracycline-based systems (see, e.g. Clontech's Tet On/Tet Off™ system; Palo Alto, 
CA). The arsenite-inducible system described is further depicted in Figures 1A, IB, 
and 2. 

The selection or construction of shuttle vectors and the selection and use of 
inducible systems are well known and thus other shuttle vectors appropriate for other 
35 bacteria can be readily provided by those skilled in the art. 

Standard methodologies for expressing proteins from constructs, and isolating 
and manipulating those proteins, for example in cross-linking and affinity 
chromatography studies, may be found in various commonly available and known 
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laboratory manuals. See, e.g., Current Protocols in Protein Science, John Wiley & 
Sons, Secaucus, N.J., and Maniatis, T. et al (1989) Molecular Cl oning: A Laboratory 
Manual Cold Spring Harbor University Press, Cold Spring, N.Y. 

It has been found that certain phage or other viruses inhibit host cells, at least 
5 in part, by producing an antisense RNA which binds to and inhibits translation from a 
bacterial RNA seqeunce. Thus, in the case of potentially inhibitor RNA transcripts 
encoded by the phage genome, a strong indicator of a possible inhibitory function is 
provided by the identification of phage sequence which is the identical to or fully 
complementary (or with only a small percentage of mismatch, e.g., <10%, preferably 
10 less than 5%, most preferably less than 3%, to a bacterial sequence. This approach is 
convenient in the case of bacteria which have been essentially completely sequenced, 
as the comparison can be performed by computer using public database information. 
Q The inhibitory effect of the transcript can be confirmed using expression of the 

^ phage sequence in a host bacterium. If needed, such inhibitory can also be tested by 

\*\ 15 transfecting the cells with a vector which will transcribe the phage sequence to form 
|: RNA in such manner that the RNA produced will not be translated into a polypeptide. 

In Inhibition under such conditions provides a strong indication that the inhibition is due 

K to the transcript rather than to an encoded polypeptide. 

L In an alternative, the expression of an ORF in a host bacterium is found to be 

ry 20 inhibitory, but the inhibition if found to be due to an RNA product of the genomic 
J! coding region. For antisense inhibition, the sequence of the bacterial target nucleic 

i|| acid sequence can be identified by inspection of the phage sequence, and the full 

^6 sequence of the relevant coding region for the bacterial product can be found from a 

database of the bacterial genomic sequence or can be isolated by standard techniques 
25 {e.g., a clone in a genomic library can be isolated which contains the full bacterial 
ORF, and then sequenced). 

In either case, the identification of a target which is inhibited by an RNA 
transcript produced by a phage provides both the possible inhibition of bacteria 
naturally containing the same target nucleic acid sequence, as well as the ability to use 
30 the target sequence in screening for other types of compounds which will act directly 
on the target nucleic acid sequence or on a polypeptide product expressed or 
regulated, at least in part, by the target of the inhibitory phage RNA. 

In some cases it will be found that the target of an inhibitory phage RNA or 
protein has previously been found to be a target of an inhibitory phage RNA or 
35 protein has previously been found to be a target for an antibacterial agent. In such 
cases, the phage inhibitor can still provide useful information if it is found that the 
phage-encoded product acts at a different site than the previously identified 
antibacterial agent or inhibitor, i.e., acts at a phage-specific site. For many targets, 
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action at a different site provides highly beneficial characteristics and/or information. 
For example, an alternate site of inhibitor action can at least partially overcome a 
resistance mechanism in a bacterium. As an illustration, in many cases, resistance is 
due, in large part, to altered binding characteristics of the immediate target to the 
5 antibacterial agent. The altered binding is due to a structural change which prevents 
or destabilizes the binding. However, the structural change is frequently quite local, 
so that compounds which bind at different local sites will b unaffected or affected to a 
much lesser degree. Indeed, in some cases the local sites will be on a different 
molecule and so may be completely unaffected by the local structural change creating 
10 resistance to the original agent(s). An example of resistance due to altered binding is 
provided by methicillin-resistant Staphylococcus aureus, in which the resistance is 
due to an altered penicillin-binding protein. 

CI In other cases, a new site of action can have improved accessibility as 

compared to a site acted on by a previously identified agent. This can, for example, 

111 1 5 assist in allowing effective treatment at lower doses, or in allowing access by a larger 

J range of types of compounds, potentially allowing identification of more potential 

||! active agents. 

■* s Another advantage is that the structural characteristics of a different site of 

action will lead to identification and/or development of inhibitors with different 
W 20 structures and different pharmacological parameter. This can allow a greater range of 

possibilities when selecting an antibacterial agent. 
:i| Yet further, different sites often produce different inhibitory characteristics in 

* the target organism. This is commonly the case for multi-domain target proteins. 

Thus, inhibition targeting an alternate site can produce more efficacious action, e.g., 
25 faster killing, slower development of resistance, lower numbers of surviving cells, and 

different secondary effects (for example, different nutrient utilization). 

Validating Identified Inhibitory Phase ORFs 

A fifth step involves validating the identified phage inhibitor ORF by 
30 independent methods, and delineating further possible smaller segments of the ORFs 
that have inhibitory activity. Several methods exist to validate the role of the 
identified ORF as an inhibitor ORF. 

One example utilizes the creation of a mutant variant of the phage ORF in 
which the candidate ORF carries a partial or complete loss-of-function mutation that 
35 is measurable as compared with the non-mutant ORF. Comparison of the effects of 
expression of the loss of function mutant with the normal ORF provides confirmation 
of the identification of an inhibitor ORF where the loss of function mutant provides a 
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measurably lower level of inhibition, preferably no inhibition. The loss of function 
may be conditional, e.g., temperature sensitive. 

Once validation of the inhibitor ORF is achieved, a bi-directional deletion 
analysis can be carried out using the same experimental system to identify the 
5 minimal polypeptide segment that has inhibitor activity. This may be carried out by a 
variety of means, e.g., by exonuclease or PCR methodologies, and is used to 
determine if a relatively small segment of the ORF {i.e., the product of the ORF) still 
possesses inhibitory activity when isolated away from its native sequence. If so, a 
portion of the ORF encoding this "active portion" can be used as a template for the 
10 synthesis of novel anti-microbial agents and further allowing derivation of the peptide 
sequence, e.g., using modified peptides and/or peptidomimetics. 

In creation of certain peptidomimetics, the peptide backbone is transformed 
O into a carbon-based hydrophobic structure that can retain inhibitor activity against the 

■|f bacterium. This is done by standard medicinal chemistry methods, typically 

111 15 monitored by measuring growth inhibition of the various molecules in liquid cultures 
+; or on solid medium. These mimetics can also represent lead compounds for the 

y?i development of novel antibiotics. 

W Recently, a major effort has been undertaken by the pharmaceutical industry 

L and their biotechnology partners for the sequencing of bacterial pathogen genomes. 

Ft! 20 The rationale is that the systematic sequencing of the genome will identify all of the 
jj{ bacterial proteins and therefore this proteome will be the target for designing novel 

li inhibitor antibiotics. Although systematic, this approach has several major problems. 

* xh e fi rs t is that analysis of primary amino acid sequences of bacterial proteins does 

not immediately reveal which protein will be essential for viability of the bacterium, 
25 and target validation is thus a major issue. The second problem is one of redundancy, 
as several biochemical pathways are either structurally duplicated in bacteria 
(different isoforms of the same enzyme), or functionally duplicated by the presence of 
salvage pathways in the event of a metabolic block in one pathway (different 
nutritional conditions). The third is that even a valid target may not be structurally or 
30 functionally amenable to inhibition by small molecules because of inaccessibility 
(sequestration of target). 

Therefore, there is considerable interest within the pharmaceutical and 
biotechnology industry in identifying key targets for drug discovery amongst the mass 
of novel targets generated by large-scale genomic sequencing projects. 
35 On the other hand, and underscoring the instant invention, the phages herein 

described have, over millions of years, evolved specific mechanisms to target such 
key biochemical pathways and proteins. In the few cases where inhibition by phages 
has been elucidated {e.g., see ref. 3), such bacterial targets are invariably rate-limiting 
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in their respective biochemical pathways, are not redundant, and/or are readily 
accessible for inhibition by the phage (or by another inhibitory compound). 
Therefore, the sixth step of this invention involves identifying the host biochemical 
pathways and proteins that are targeted by the phage inhibitory mechanisms. 

5 

Identifying. Validating, and Characterizing Bacterial Host Tar get Proteins and 
Affected Pathways 

A rationale for this step is that the inhibitor ORF product from the phage 
physically interacts with and/or modifies certain microbial host components to block 
10 their function. Exemplary approaches which can be used to identify the host bacterial 
pathways and proteins that interact with, and preferably also are inhibited by, phage 
ORF product(s) are described below. 
% The first approach is a genetic screen to determine physiological 

if protein-.protein interaction, for example, using a yeast two hybrid system. In this 

J 15 assay, the phage ORF is fused to the carboxyl terminus of the yeast Gal4 activation 
= i: domain II (amino acids 768-881) to create a bait vector. A cDNA library of cloned S. 

ftj aureus sequences which have been engineered into a plasmid where the S. aureus 

sequences are fused to the DNA binding domain of Gal4 is also generated. These 
J plasmids are introduced alone, or in combination, into yeast strain Y190 - previously 

f!J 20 engineered with chromosomally integrated copies of the E. coli lacZ and the 
% selectable HIS3 genes, both under Gal4 regulation (Durfee, T., Becherer, K., Chen, P.- 

L., Yeh, S.-H., Yang, Y., Kilburn, A.E., Lee, W.-H., and Elledge, S.J. (1993). Genes 
& Dev. 7, 555-569). If the two proteins expressed in yeast interact, the resulting 
complex will activate transcription from promoters containing Gal4 binding sites. A 
25 lacZ and His3 gene, each driven by a promoter containing Gal4 binding sites, have 
been integrated into the genome of the host yeast system used for measuring protein- 
protein interactions. Such a system provides a physiological environment in which to 
detect potential protein interactions. This system has been extensively used to identify 
novel protein-protein interaction partners and to map the sites required for interaction 
30 (for example, to identify interacting partners of translation factors (Qiu, H., Garcia- 
Barrio, M.T., and Hinnebusch, A.G. (1998). Mol & Cell Biology 18, 2697-2711), 
transcription factors (Katagiri, T., Saito, H., Shinohara, A., Ogawa, H„ Kamada, N., 
Nakamura ,Y., and Miki, Y. (1998). Genes, Chromosomes & Cancer 21, 217-222), 
and proteins involved in signal transduction (Endo, T.A., Masuhara, M., Yokouchi, 
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M., Suzuki, R., Sakamoto, H., Mitsui, K., Matsumoto, A., Tanimura, S., Ohtsubo, 
M., Misawa,H., Miyazaki, T., Leonor N., Taniguchi, T., Fujita, T., Kanakura, Y., 
Komiya, S., and Yoshimura, A. Nature. 387, 921-924). This approach has also been 
used in many published reports to identify interaction between mammalian viral and 

5 mammalian cell proteins. 

For example, the non-structural protein NS1 of parvovirus is essential for viral 
DNA amplification and gene expression and is also the major cytopathic effector of 
these viruses. A yeast two-hybrid screen with NS1 identified a novel cellular protein 
of unknown function that interacts with NS-1, called SGT, for small glutamine-rich 

10 tetratricopeptide repeat (TPR)-containing protein (Cziepluch C. KordesE. PoireyR. 
GrewenigA. Rommelaere, J, and Jauniaux JC. (1998) J Virol 72, 4149-4156). In 
another screen, the adenovirus E3 protein was recently shown to interact with a novel 
tumor necrosis factor alpha-inducible protein and to modulate some of the activities of 
E3 (Li Y. Kang J. and Horwitz M.S. (1998). Mol & Cell Biol 18, 1601-1610). In yet 

15 another recent screen, the herpes simplex virus 1 alpha regulatory protein ICPO was 
found to interact with (and stabilize) the cell cycle regulator cyclin D3 (Kawaguchi Y. 
Van Sant C. and Roizman B. (1997). J Virol 71,7328-7336). 

Another two-hybrid system for identifying proteinrprotein interactions is 
commercially available from STRATEGENE™ as the CYTO-TRAP™ system 

20 (Chang et al, Strategies Newsletter 11(3), 65-68 (1998)(from Stratagene)). The 
system is a yeast-based method for detecting proteinrprotein interactions in vivo, using 
activation of the Ras signal transduction cascade by localizing a signal pathway 
component, human Sos (hSos), to its activation site in the yeast plasma membrane. 
The system uses a temperature-sensitive Saccharomyces cerevisiae mutant, strain 

25 cdc25H, which contains a point mutation at amino acid residue 1328 of the cdc25 
gene. This gene encodes a guanyl nucleotide exchange factor which binds and 
activates Ras, leading to cell growth. The mutation in the cdc25 gene prevents host 
growth at 37°C, but at a permissive temperature of 25°C, growth is normal. The 
system utilizes the ability of (hSos) to complement the cdc25 defect and activate the 

30 yeast Ras signaling pathway. Once (hSos) is expressed and localized to the plasma 
membrane, the cdc25H yeast strain grows at 37°C. Localizing hSos to the plasma 
membrane occurs through a protein:protein interaction. A protein of interest, or bait, 
is expressed as a fusion protein with hSos. The library, or target proteins are 
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expressed with the myristylation membrane-localization signal. The yeast cells are 
then incubated under restrictive conditions (37°C). If the bait and the target protein 
interact, the hSos protein is recruited to the membrane, activating the Ras signaling 
pathway and allowing the cdc25H yeast strain to grow at the restrictive temperature. 
5 The second approach is based on identifying protein:protein interactions 

between the phage ORF product and bacterial S. aureus, e.g., proteins using a 
biochemical approach based, for example, on affinity chromatography. This approach 
has been used, for example, to identify interactions between lambda phage proteins 
and proteins from their E. coli host (Sopta, M., Carthew, R.W., and Greenblatt, J. 

10 (1985) J. Biol. Chem. 260, 10353-10369). The phage ORF is fused to a peptide tag 
(e.g. glutathione-S-transferase ("GST"), 6xHIS, ("HIS") and/or calmodulin binding 
protein ("CPB")) within a commercially available plasmid vector that directs high 
level expression on induction of a suitably responsive promoter driving the fusion's 
expression. The translated fusion protein is expressed in E. coli, purified, and 

1 5 immobilized on a solid phase matrix via, for example the tag. Total cell extracts from 
the host bacterium, e.g., S. aureus, are then passed through the affinity matrix 
containing the immobilized phage ORF fusion protein; host proteins retained: on the 
column are then eluted under different conditions of ionic strength, pH, detergents 
etc., and characterized by gel electrophoresis and other techniques. Appropriate 

20 controls are run to guard against nonspecific binding to the resin. Target proteins thus 
recovered should be enriched for the phage protein/peptide of interest and are 
subsequently electrophoretically or otherwise separated, purified, sequenced, or 
biochemically analyzed. Usually sequencing entails individual digestion of the 
proteins to completion with a protease (e.g. -trypsin), followed by molecular mass and 

25 amino acid composition and sequence determination using, for example, mass 
spectrometry, e.g., by MALDI-TOF technology (Qin, J., Fenyo, D., Zhao, Y., Hall, 
W.W., Chao, D.M., Wilson, CJ., Young, R.A. and Chait, B.T. (1997). Anal. Chem. 
69, 3995-4001). 

The sequence of the individual peptides from a single protein are then 
30 analyzed by the bioinformatics approach described above to identify the S. aureus 
protein interacting with the phage ORF. This analysis is performed by a computer 
search of the S. aureus genome for an identified sequence. Alternatively, all tryptic 
peptide fragments of the S. aureus genome can be predicted by computer software, 
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and the molecular mass of such fragments compared to the molecular mass of the 
peptides obtained from each interacting protein eluted from the affinity matrix. The 
responsible gene sequence can be obtained, for example by using synthetic degenerate 
nucleic acid sequences to pull out the corresponding homologous bacterial sequence. 
5 Alternatively, antibodies can be generated against the peptide and used to isolate 
nascent peptide/mRNA transcript complexes, from which the mRNA can be reverse 
transcribed, cloned, and further characterized using the procedures discussed herein. 

A variety of other binding assay methods are known in the art and can be used 
to identify interactions between phage proteins and bacterial proteins or other bacterial 
1 0 cell components. Such methods which allow or provide identification of the bacterial 
component can be used in this invention for identifying putative targets. 
% Validation of the interaction between the phage ORF product and the bacterial 

HP proteins or other components can be obtained by a second independent assay (e.g., 

|: co-immunoprecipitation or protein-protein crosslinking experiments (Qiu, H. 5 Garcia- 

!}{ 15 Barrio, M.T., and Hinnebusch, A.G. (1998). Mol & Cell Biology 18, 2697-2711; 
nil Brown, S. and Blumenthal, T. (1976). Proa Natl Acad. Sci. USA 73, 1131-1135)). 

1^ Finally, the essential nature of the identified bacterial proteins is preferably 

J J determined genetically by creating a constitutive or inducible partial or complete loss- 

Ill of-function mutation in the gene encoding the identified interacting bacterial protein. 

% 20 This mutant is then tested for bacterial survival and replication. 

The protein target of the phage inhibitor function can also be identified using a 
genetic approach. Two exemplary approaches will be delineated here. The first 
approach involves the overexpression of a predetermined phage inhibitor protein in 
mutagenized host bacteria, e.g., S. aureus, followed by plating the cells and searching 
25 for colonies that can survive the inhibitor. These colonies will then be grown, their 
DNA extracted and cloned into an expression vector that contains a replicon of a 
different incompatibility group, and preferably having a different selectible marker 
than the plasmid expressing the phage inhibitor. Thus, host DNA fragments from the 
mutant that can protect the cell from phage ORF inhibition can be sequenced and 
30 compared with that of the bacterial host to determine in which gene the mutation lies. 
This approach allows rapid determination of the targets and pathways that are affected 
by the inhibitor. 
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Alternatively, the bacterial targets can be determined in the absence of 
selecting for mutations using an approach known as "multicopy suppression". In this 
approach, the DNA from the wild type host is cloned into an expression vector that 
can coexist, as previously described, with one containing a predetermined phage 
5 inhibitor. Those plasmids that contain host DNA fragments and genes that protect the 
host from the phage inhibitor can then be isolated and sequenced to identify putative 
targets and pathways in the host bacteria. 

Regardless of the specific mode of identification, screening assays may 
additionally utilize gene fusions to specific "reporter genes" to identify a bacterial 

1 0 gene(s) whose expression is affected when the host target pathway is affected by the 
phage inhibitor. Such gene fusions can be used to search a number of small molecule 
compounds for inhibitors that may affect this pathway and thus cause cell inhibition. 
This approach will allow the screening of a large number of molecules on petri dishes 
or 96-well format by monitoring for a simple color change in the bacterial colonies. 

1 5 In this manner, we can validate host targets and classes of compounds for further 
study and clinical development. These inhibitors also represent lead compounds for 
the development of other antibiotics. 

Bioinformatics and comparative genomics are preferably then applied to the 
identified bacterial gene products to predict biochemical function. The biochemical 

20 activity of the protein can be verified in vitro in cell free assays or in vivo in intact 
cells. In vitro biochemical assays utilizing cell-free extracts or purified protein are 
established as a basis for the screening and development of inhibitors. 

These inhibitors, preferably small molecule inhibitors, may comprise peptides, 
antibodies, products from natural sources such as fungal or plant extracts or small 

25 molecule organic compounds. In general, small molecule organic compounds are 
preferred. These compounds may, for example, be identified within large compound 
libraries, including combinatorial libraries. For example, a plurality of compounds, 
preferably a large number of compounds can be screened to determine whether any of 
the compounds binds or otherwise disrupts or inhibits the identified bacterial target. 

30 Compounds identified as having any of these activities can then be evaluated further 
in cell culture and/or animal model systems to determine the pharmacological 
properties of the compound, including the specific anti-microbial ability of the 
compound. 
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For mixtures of natural products, including crude preparations, once a 
preparation or fraction of a preparation is shown the have an anti-microbial activity, 
the active substance can be isolated and identified using techniques well known in the 
art, if the compound is not already available in a purified form. 
5 Identified compounds possessing anti-microbial activity and similar 

compounds having structural similarity can be further evaluated and, if necessary, 
derivatized according to synthesis and/or modification methods available in the art 
selected as appropriate for the particular starting molecule. 

10 Derivatization of identified anti-microbials 

In cases where the identified anti-microbials above might represent peptidal 
compunds, the in vivo effectiveness of such compounds may be advantageously 
enhanced by chemical modification using the natural polypeptide as a starting point 

1 5 and incorporating changes that provide advantages for use, for example, increased 
stability to proteolytic degradation, reduced antigenicity, improved tissue penetration, 
and/or improved delivery characteristics. 

In addition to active modifications and derivative creations, it can also be 
useful to provide inactive modifications or derivatives for use as negative controls or 

20 introduction of immunologic tolerance. For example, a biologically inactive 
derivative which has essentially the same epitopes as the corresponding natural 
antimicrobial can be used to induce immunological tolerance in a patient being 
treated. The induction of tolerance can then allow uninterrupted treatment with the 
active anti-microbial to continue for a significantly longer period of time. 

25 Modified anti-microbial polypeptides and derivatives can be produced using a 

number of different types of modifications to the amino acid chain. Many such 
methods are known to those skilled in the art. The changes can include, for example, 
reduction of the size of the molecule, and/or the modification of the amino acid 
sequence of the molecule. In addition, a variety of different chemical modifications of 

30 the naturally occurring polypeptide can be used, either with or without modifications 
to the amino acid sequence or size of the molecule. Such chemical modifications can, 
for example, include the incorporation of modified or non-natural amino acids or non- 
amino acid moieties during synthesis of the peptide chain, or the post-synthesis 
modification of incorporated chain moieties. 
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The oligopeptides of this invention can be synthesized chemically or through 
an appropriate gene expression system. Synthetic peptides can include both naturally 
occurring amino acids and laboratory synthesized, modified amino acids. 

Also provided herein are functional derivatives of anti-microbial proteins or 
5 polypeptides. By "functional derivative" is meant a "chemical derivative," 

"fragment," "variant," "chimera," or "hybrid" of the polypeptide or protein, which 
terms are defined below. A functional derivative retains at least a portion of the 
function of the protein, for example reactivity with a specific antibody, enzymatic 
activity or binding activity. 

1 0 A "chemical derivative" of the complex contains additional chemical moieties 

not normally a part of the protein or peptide. Such moieties may improve the 
molecule's solubility, absorption, biological half-life, and the like. The moieties may 
alternatively decrease the toxicity of the molecule, eliminate or attenuate any 
undesirable side effect of the molecule, and the like. Moieties capable of mediating 

1 5 such effects are disclosed in Alfonso and Gennaro (1995). Procedures for coupling 
such moieties to a molecule are well known in the art. Covalent modifications of the 
protein or peptides are included within the scope of this invention. Such 
modifications may be introduced into the molecule by reacting targeted amino acid 
residues of the peptide with an organic derivatizing agent that is capable of reacting 

20 with selected side chains or terminal residues, as described below. 

Cysteinyl residues most commonly are reacted with alpha-haloacetates (and 
corresponding amines), such as chloroacetic acid or chloroacetamide, to give 
carboxymethyl or carboxyamidomethyl derivatives. Cysteinyl residues also are 
derivatized by reaction with bromotrifluoroacetone, chloroacetyl phosphate, N- 

25 alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p-chloro- 
mercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-l,3- 
diazole. 

Histidyl residues are derivatized by reaction with diethylprocarbonate at pH 
5.5-7.0 because this agent is relatively specific for the histidyl side chain. Para- 
30 bromophenacyl bromide also is useful; the reaction is preferably performed in 0. 1 M 
sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues are reacted with succinic or other 
carboxylic acid anhydrides. Derivatization with these agents has the effect of 
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reversing the charge of the lysinyl residues. Other suitable reagents for derivatizing 
primary amine- containing residues include imidoesters such as methyl 
picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; 
trinitrobenzenesulfonic acid; O-methylisourea; 2,4 pentanedione; and transaminase- 
5 catalyzed reaction with glyoxylate. 

Arginyl residues are modified by reaction with one or several conventional 
reagents, among them phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and 
ninhydrin. Derivatization of arginine residues requires that the reaction be performed 
in alkaline conditions because of the high pK^ of the guanidine functional group. 
1 0 Furthermore, these reagents may react with the groups of lysine as well as the arginine 
alpha-amino group. 

Tyrosyl residues are well-known targets of modification for introduction of 
£ spectral labels by reaction with aromatic diazonium compounds or tetranitromethane. 

,|: Most commonly, N-acetylimidizol and tetranitromethane are used to form O-acetyl 

I % 15 tyrosyl species and 3-nitro derivatives, respectively. 

HI Carboxyl side groups (aspartyl or glutamyl) are selectively modified by 

yi reaction carbodiimide (R f -N-C-N-R f ) such as l-cyclohexyl-3-(2-morpholinyl(4-ethyl) 

J*[ carbodiimide or l-ethyl-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. Furthermore, 

111 aspartyl and glutamyl residues are converted to asparaginyl and glutaminyl residues 

% 20 by reaction with ammonium ions. 

Glutaminyl and asparaginyl residues are frequently deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues are 
deamidated under mildly acidic conditions. Either form of these residues falls within 
the scope of this invention. 
25 Derivatization with bifunctional agents is useful, for example, for cross- 

linking component peptides to each other or the complex to a water-insoluble support 
matrix or to other macromolecular carriers. Commonly used cross-linking agents 
include, for example, 1,1-bis (diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobi- 
30 functional imidoesters, including disuccinimidyl esters such as 3,3'- 

dithiobis(succimmidylpropionate), and bifunctional maleimides such as bis-N- 
maleimido-l,8-octane. Derivatizing agents such as methyl-3-[p-azidophenyl) 
dithiolpropioimidate yield photoactivatable intermediates that are capable of forming 
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crosslinks in the presence of light. Alternatively, reactive water-insoluble matrices 
such as cyanogen bromide-activated carbohydrates and the reactive substrates 
described in U.S. Patent Nos. 3,969,287; 3,691,016; 4,195,128; 4,247,642; 4,229,537; 
and 4,330,440 are employed for protein immobilization. 
5 Other modifications include hydroxylation of proline and lysine, 

phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the 
alpha-amino groups of lysine, arginine, and histidine side chains (Creighton, T.E., 
Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)), acetylation of the N-terminal amine, and, in some instances, 
1 0 amidation of the C-terminal carboxyl groups. 

Such derivatized moieties may improve the stability, solubility, absorption, 
biological half life, and the like. The moieties may alternatively eliminate or attenuate 
% any undesirable side effect of the protein complex. Moieties capable of mediating 

JS such effects are disclosed, for example, in Alfonso and Gennaro (1995). 

s ji 15 The term "fragment 11 is used to indicate a polypeptide derived from the amino 

rtl acid sequence of the protein or polypeptide having a length less than the full-length 

m polypeptide from which it has been derived. Such a fragment may, for example, be 

!J: produced by proteolytic cleavage of the full-length protein. Preferably, the fragment 

111 is obtained recombinantly by appropriately modifying the DNA sequence encoding 

20 the proteins to delete one or more amino acids at one or more sites of the C-terminus, 
N-terminus, and/or within the native sequence. 

Another functional derivative intended to be within the scope of the present 
invention is a "variant" polypeptide which either lacks one or more amino acids or 
contains additional or substituted amino acids relative to the native polypeptide. The 
25 variant may be derived from a naturally occurring polypeptide by appropriately 

modifying the protein DNA coding sequence to add, remove, and/or to modify codons 
for one or more amino acids at one or more sites of the C-terminus, N-terminus, 
and/or within the native sequence. 

A functional derivative of a protein or polypeptide with deleted, inserted 
30 and/or substituted amino acid residues may be prepared using standard techniques 
well-known to those of ordinary skill in the art. For example, the modified 
components of the functional derivatives may be produced using site-directed 
mutagenesis techniques (as exemplified by Adelman et al, 1983, DNA 2:183; 
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Sambrook et al., 1989) wherein nucleotides in the DNA coding sequence are modified 
such that a modified coding sequence is produced, and thereafter expressing this 
recombinant DNA in a prokaryotic or eukaryotic host cell, using techniques such as 
those described above. Alternatively, components of functional derivatives of 
5 complexes with amino acid deletions, insertions and/or substitutions may be 

conveniently prepared by direct chemical synthesis, using methods well-known in the 
art. 

Insofar as other anti-microbial inhibitor compounds identified by the invention 
described herein may not be peptidal in nature, other chemical techniques exist to 
10 allow their suitable modification, as well, and according the desirable principles 
discussed above. 

Administration and Pharmaceutical Compositions 

1 5 For the therapeutic and prophylactic treatment of infection, the preferred 

method of preparation or administration of anti-microbial compounds will generally 
vary depending on the precise identity and nature of the anti-microbial being 
delivered. Thus, those skilled in the art will understand that administration methods 
known in the art will also be appropriate for the compounds of this invention. 

20 The particularly desired anti-microbial can be administered to a patient either 

by itself, or in pharmaceutical compositions where it is mixed with suitable carriers or 
excipient(s). In treating an infection, a therapeutically effective amount of an agent or 
agents is administered. A therapeutically effective dose refers to that amount of the 
compound that results in amelioration of one or more symptoms of bacterial infection 

25 and/or a prolongation of patient survival or patient comfort. 

Toxicity, therapeutic and prophylactic efficacy of anti-microbials can be 
determined by standard pharmaceutical procedures in cell cultures and/or 
experimental organisms such as animals, e.g., for determining the LD 50 (the dose 
lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 

30 50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio LD 50 /ED 50 . Compounds which 
exhibit large therapeutic indices are preferred. The data obtained from these cell 
culture assays and animal studies can be used in formulating a range of dosage for use 
in humans. The dosage of such compounds lies preferably within a range of 
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circulating concentrations that include the ED 50 with little or no toxicity. The dosage 
may vary within this range depending upon the dosage form employed and the route 
of administration utilized. 

For any compound identified and used in the method of the invention, the 
5 therapeutically effective dose can be estimated initially from cell culture assays. Such 
information can be used to more accurately determine useful doses in organisms such 
as plants and animals, preferably mammals, and most preferably humans. Levels in 
plasma may be measured, for example, by HPLC or other means appropriate for 
detection of the particular compound. 

1 0 The exact formulation, route of administration and dosage can be chosen by 

the individual physician in view of the patient's condition (see e.g. Fingl et al., in The 
Pharmacological Basis of Therapeutics , 1975, Ch. 1 p.l). 

It should be noted that the attending physician would know how and when to 
terminate, interrupt, or adjust administration due to toxicity, organ dysfunction, or 

1 5 other systemic malady. Conversely, the attending physician would also know to 

adjust treatment to higher levels if the clinical response were not adequate (precluding 
toxicity). The magnitude of an administered dose in the management of the disorder 
of interest will vary with the severity of the condition to be treated and the route of 
administration. The severity of the condition may, for example, be evaluated, in part, 

20 by standard prognostic evaluation methods. Further, the dose and perhaps dose 
frequency, will also vary according to the age, body weight, and response of the 
individual patient. A program comparable to that discussed above also may be used 
in veterinary or phyto medicine. 

Depending on the specific infection target being treated and the method 

25 selected, such agents may be formulated and administered systemically or locally, i.e., 
topically. Techniques for formulation and administration may be found in Alfonso 
and Gennaro (1995). Suitable routes may include , for example, oral, rectal, 
transdermal, vaginal, transmucosal, intestinal, parenteral, intramuscular, 
subcutaneous, or intramedullary injections, as well as intrathecal, intravenous, or 

30 intraperitoneal injections. 

For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks' solution, 
Ringer's solution, or physiological saline buffer. For transmucosal administration, 
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penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Use of pharmaceutically acceptable carriers to formulate identified anti- 
microbials of the present invention into dosages suitable for systemic administration is 
5 within the scope of the invention. With proper choice of carrier and suitable 

manufacturing practice, the compositions of the present invention, in particular those 
formulated as solutions, may be administered parenterally, such as by intravenous 
injection. Appropriate compounds can be formulated readily using pharmaceutically 
acceptable carriers well known in the art into dosages suitable for oral administration. 
1 0 Such carriers enable the compounds of the invention to be formulated as tablets, pills, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by 
rjS a patient to be treated. 

;P Agents intended to be administered intracellularly may be administered using 

techniques well known to those of ordinary skill in the art. For example, such agents 
% 15 may be encapsulated into liposomes, then administered as described above. 
lU Liposomes are spherical lipid bilayers with aqueous interiors. All molecules present 

l& in an aqueous solution at the time of liposome formation are incorporated into the 

2f aqueous interior. The liposomal contents are both protected from the external 

rtf microenvironment and, because liposomes fuse with cell membranes, are efficiently 

20 delivered into the cell cytoplasm. Additionally, due to their hydrophobicity, small 
organic molecules may be directly administered intracellularly. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve the intended purpose. Determination of the effective amounts is well within 
25 the capability of those skilled in the art. 

In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically acceptable carriers comprising excipients and 
auxiliaries which facilitate processing of the active compounds into preparations 
which can be used pharmaceutically. The preparations formulated for oral 
30 administration may be in the form of tablets, dragees, capsules, or solutions, including 
those formulated for delayed release or only to be released when the pharmaceutical 
reaches the small or large intestine. 



SD-63311.4 



248/037 



The pharmaceutical compositions of the present invention may be 
manufactured in a manner that is itself known, e.g., by means of conventional mixing, 
dissolving, granulating, dragee-making, levitating, emulsifying, encapsulating, 
entrapping or lyophilizing processes. 

5 Pharmaceutical formulations for parenteral administration include aqueous 

solutions of the active anti-microbial compounds in water-soluble form. 
Alternatively, suspensions of the active compounds may be prepared as appropriate 
oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, 

10 or liposomes. Aqueous injection suspensions may contain substances which increase 
the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents 
which increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. 

1 5 Pharmaceutical preparations for oral use can be obtained by combining the 

active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to 
obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as 
sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such 

20 as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 
disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, or alginic acid or a salt thereof such as sodium alginate. 

25 Dragee cores are provided with suitable coatings. For this purpose, 

concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium 
dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for identification 

30 or to characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push- fit 
capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active 
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ingredients in admixture with filler such as lactose, binders such as starches, and/or 
lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, 
such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
5 stabilizers may be added. 

The above methodologies may be employed either actively or prophylactically 
against an infection of interest. 

Computer-related Aspects and Embodiments 

In addition to the provision of compounds as chemical entities, nucleotide 

10 sequences, or fragments thereof at least 95%, preferably at least 97%, more preferably 
at least 99%, and most preferably at least 99.9% identical to phage inhibitor sequences 
can also be provided in a variety of additional media to facilitate various uses. 

Thus, as used in this section, "provided" refers to an article of manufacture, 
rather than an actual nucleic acid molecule, which contains a nucleotide sequence of 

1 5 the present invention; e.g., a nucleotide sequence of an exemplary bacteriophage or a 
sequence encoding a bacterial target or a fragment thereof, preferably a nucleotide 
sequence at least 95%, more preferably at least 99% and most preferably at least 
99.9% identical to such a bacteriophage or bacterial sequence, for example, to a 
polynucleotide of an unsequenced phage listed in Table 1, preferably of bacteriophage 

20 77 (S. aureus host) or bacteriophage 3A {S.aureus host) or bacteriophage 96 (S. 
aureus host). Such an article provides a large portion of the particular bacteriophage 
genome or bacterial gene and parts thereof (e.g., a bacteriophage open reading frame 
(ORF)) in a form which allows a skilled artisan to examine and/or analyze the 
sequence using means not directly applicable to examining the actual genome or gene 

25 or subset thereof as it exists in nature or in purified form as a chemical entity. 

In one application of this aspect, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
readable media" refers to any medium that can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, such 

30 as floppy discs, hard disc storage medium, magnetic tape; optical storage media such 
as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories, such as magnetic/optical storage media. A skilled artisan can readily 
appreciate how any of the presently known computer readable mediums can be used 
to create an article of manufacture which includes one or more computer readable 

35 media having recorded thereon a nucleotide sequence or sequences of the present 
invention. Likewise, it will be clear to those of skill how additional computer 
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readable media that may be developed also can be used to create analogous 
manufactures having recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently 
5 known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present 
invention. 

A variety of data storage structures are available to a skilled artisan for 
creating a computer readable medium having recorded thereon a nucleotide sequence 

10 of the present invention. The choice of the data storage structure will generally be 
based on the means chosen to access the stored information. In addition, a variety of 
data processor programs and formats can be used to store the nucleotide sequence 
information of the present invention on computer readable medium. The sequence 
information can, for example, be presented in a word processing test file, formatted in 

15 commercially available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as 
DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of 
data processor structuring formats (e.g., text file or database) in order to obtain 
computer readable medium having recorded thereon the nucleotide sequence 

20 information of the present invention. 

Computer software is publicly available which allows a skilled artisan to 
access sequence information provided in a computer readable medium. Thus, by 
providing in computer readable form a nucleotide sequence of an unsequenced 
bacteriophage, such as an exemplary bacteriophage listed in Table 1 or of a sequence 

25 encoding a bacterial target or a fragment thereof, preferably a nucleotide sequence at 
least 95%, more preferably at least 99% and most preferably at least 99.9% identical 
to such a bacteriophage or bacterial sequence, for example, to a polynucleotide of 
bacteriophage 77 (S. aureus host) or bacteriophage 3 A (S.aureus host) or 
bacteriophage 96 (S. aureus host), the present invention enables the skilled artisan to 

30 routinely access the provided sequence information for a wide variety of purposes. 

Those skilled in the art understand that software can implement a variety of 
different search or analysis software which implement sequence search and analysis 
algorithms, e.g. 9 the BLAST (Altschul et al., J. Mol. Biol. 215:403410 (1990) and 
BLAZE (Brutlag et al, Comp. Chem 17:203-207 (1993)) search algorithms. For 

35 example, such search algorithms can be implemented on a Sybase system and used to 
identify open reading frames (ORFs) within the bacteriophage genome which contain 
homology to ORFs or proteins from other viruses, e.g, other bacteriophage, and other 
organisms, e.g., the host bacterium. Among the ORFs discussed herein are protein 
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encoding fragments of the bacteriophage genomes which encode bacteria-inhibiting 
proteins or fragments. 

The present invention further provides systems, particularly computer-based 
systems, which contain the sequence information described. Such systems are 
5 designed to identify, among other things, useful fragments of the bacteriophage 
genomes. 

As used herein, "a computer-based system" refers to the hardware, software, 
and data storage media used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware of the computer-based systems of the 

10 present invention comprises a central processing unit (CPU), input device, output 
device, and data storage medium or media. A skilled artisan will readily recognize 
that any of the currently available general purpose computer-based system are suitable 
for use in the present invention, as well as a variety of different specialized or 
dedicated computer-based systems. 

15 As stated above, the computer-based systems of the present invention 

comprise data storage media having stored therein a nucleotide sequence of the 
present invention and the necessary hardware and software for supporting and 
implementing a search and/or analysis program. 

As used herein, "data storage media" refers to memory which can store 

20 nucleotide sequence information of the present invention, or a memory access means 
which can access manufactures having recorded thereon the nucleotide sequence 
information of the present invention. 

As used herein, "search program" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target 

25 structural motif with the sequence information stored within the data storage means. 
Search means are used to identify fragments or regions of the present gnomic 
sequences which match a particular target sequence or target motif. A variety of 
known algorithms are disclosed publicly and a variety of commercially available 
software for conducting search means are and can be used in the computer-based 

30 systems of the present invention. Examples of such software includes, but is not 
limited to, MacPattern (EMBL), BLASTN and BLASTX (NCBIA). A skilled artisan 
can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches and/or sequence analyses can be 
adapted for use in the present computer-based systems. 

35 As used herein in connection with sequence searches and analyses, a "target 

sequence" can be any DNA or amino acid sequence of six or more nucleotides or two 
or more amino acids. A skilled artisan can readily recognize that the longer a target 
sequence is, the less likely a target sequence will be present as a random occurrence in 
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the database. Also, the target sequence length is preferably selected to include 
sequence corresponding to a biologically relevant portion of an encoded product, for 
example a region which is expected to be conserved across a range of source 
organisms. Preferably the sequence length of a target polypeptide sequence is from 5- 
5 100 amino acids, more preferably 7-50 or 7-100 amino acids, and still more preferably 
10-80 or 10-100 amino acids. Preferably the sequence length of a target 
polynucleotide sequence is from 15-300 nucleotide residues, more preferably from 21- 
240 or 21-300, and still more preferably 30-150 or 30-300 nucleotide residues. 
However, it is well recognized that searches for commercially important fragments, 
1 0 such as sequence fragments involved in gene expression and protein processing, may 
be of shorter length. Likewise, it may be desirable to search and/or analyze longer 
sequences. 

As used herein, "a target structural motif/ 1 or "target motif," refers to any 
rationally selected sequence or combination of sequences in which the sequence(s) are 

1 5 chosen based on a three-dimensional configuration which is formed upon the folding 
of the target motif. There are a variety of target motifs known in the art. Protein 
target motifs include, but are not limited to, enzymatic active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to promoter 
sequences, hairpin structures and inducible expression elements (protein binding 

20 sequences). 

A variety of structural formats for the input and output devices can be used to 
input and output the information in the computer-based systems of the present 
invention. A preferred format for an output device ranks fragments of the 
bacteriophage or bacterial sequences possessing varying degrees of homology to the 

25 target sequence or target motif. Such presentation provides a skilled artisan with a 
ranking of sequences which contain various amounts of the target sequence or target 
motif and identifies the degree of homology contained in the identified fragment. 

A variety of comparing methods and/or devices and/or formats can be used to 
compare a target sequence or target motif with the sequence stored in data storage 

30 media to identify sequence fragments of the bacteriophage or bacterium in question. 
One skilled in the art can readily recognize that any one of the publicly available 
homology search programs can be used as the search program for the computer-based 
systems of the present invention. Of course, suitable proprietary systems that may be 
known to those of skill, or later developed, also may be employed in this regard. 

35 Figure 5 provides a block diagram of a computer system illustrative of 

embodiments of this aspect of present invention. The computer system 102 includes a 
processor 106 connected to a bus 104. Also connected to the bus 104 are a main 
memory 108 (preferably implemented as random access memory, RAM) and a variety 
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of secondary storage devices 110, such as a hard drive 112 and a removable medium 
storage device 114. The removable medium storage devicell4 may represent, for 
example, a floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A 
removable storage medium 116 (such as a floppy disk, a compact disk, a magnetic 
tape, etc.) containing control logic and/or data recorded therein may be inserted into 
the removable medium storage device 114. The computer system 102 includes 
appropriate software for reading the control logic and/or the data from the removable 
medium storage device 114, once it is inserted into the removable medium storage 
device 114. 

A nucleotide sequence of the present invention may be stored in a well-known 
manner in the main memory 108, any of the secondary storage devices 110, and/or a 
removable storage medium 116. During execution, software for accessing and 
processing the sequence (such as search tools, comparing tools, etc.) reside in main 
memory 108, in accordance with the requirements and operating parameters of the 
operating system, the hardware system and the software program or programs. 

The data storage medium in which the sequence is embodied and the central 
processor need not be part of a single stand-alone computer, but may be separated so 
long as data transfer can occur. For example, the processor or processors being 
utilized for a search or analysis can be part of one general purpose computer, and the 
data storage medium can be part of a second general purpose computer connected to a 
network, or the data storage medium can be part of a network server. As another 
example the data storage medium can be part of a computer system or network 
accessible over telephone lines or other remote connection method. 
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EXAMPLES 

Example 1: Propagation of Bacteriophage 77 of Staphylococcus aureus 

5 Bacterial propagating strain and Bacteriophage: 

The Staphylococcus aureus propagating strain 77 (PS 77) was used as a host to 
propagate its respective phage 77 (ATCC # 27699-B1). 
Purification of bacteriophage and prepration of phage DNA: 

The propagation method was carried out by using the agar layer method 

10 described by Swanstorm and Adams (Swanstrom, M. and Adams, M.H. (1951). Agar 
layer method for production of high titer phage stocks. Proc. Soc. Exptl Biol & Med. 
78: 372-375). Briefly, the PS 77 strain was grown overnight at 37°C in Nutrient broth 
[NB: 3 g Bacto Beef Extract, 5 g Bacto Peptone per liter, (Difco Laboratories)]. The 
culture was then diluted 20x in NB and incubated at 37°C until the OD 540 = .2. The 

15 suspension (15xl0 7 Bacteria) was then mixed with 15xl0 5 phage particles to give a 
ratio of 100 bacteria/phage particle in the presence of 400 jag/ml of CaCl 2 , After 
incubation of 15 min at room temperature, 7.5 ml of melted soft agar (NB 
supplemented with 0.6% of agar), were added to the mixture and poured onto the 
surface of 100 mm nutrient agar plates (3 g Bacto Beef Extract, 5 g Bacto Peptone and 

20 15 g of Bacto Agar per liter) and incubated overnight at 30°C. To collect the lysate, 20 
ml of NB were added to each plate and the soft agar layer was collected by scrapping 
off with a clean microscope slide and shaken vigorously for 5 min to break up the 
agar. The mixture was then centrifuged for 10 min at 4,000 rpm and the supernatent 
(lysate) is collected and subjected to a treatment with 10 jag /ml of DNase I and 

25 RNase A for 30 min at 37°C. To precipitate the phages particles, 10% (w/v) of PEG 
8000 and 0.5 M of NaCl were added to the lysate and the mixture was incubated on 
ice for 16 h. The phages were recovered by centrifugation at 4,000 rpm for 20 min at 
4°C on a GS-6R table top centrifuge (Beckman) . The pellet was resuspended with 2 
ml of phage buffer (1 mM MgS0 4 , 5 mM MgCl 2 , 80 mM NaCl and .1% Gelatin). The 

30 phage suspension was extracted with 1 volume of chloroform and purified by 
centrifugation using a TLS 55 rotor and the Optima TLX ultracentrifuge (Beckman), 
for 2 h at 28,000 Rpm at 4°C in preformed cesium chloride gradient as described in 
Sambrook et al. (Sambrook, J., Fritsch, E.F. and Maniatis, T (1989). Molecular 
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cloning: A laboratory Manual. Cold Spring Harbor Laboratory, New York. Cold 
Spring Harbor Laboratory Press). Banded phages were collected and ultracentrifuged 
again on an isopycnic cesium chloride gradient at 40,000 rpm for 24 h rpm at 4°C 
using a TLV rotor (Beckman). The phage was dialyzed for 4 h at room temperature 

5 against 4 L of dialysis buffer consisting of 10 mM NaCl, 50 mM Tris-HCl pH 8 and 
10 mM MgCl 2 . Phage DNA was prepared from the phages by adding 20 mM EDTA, 
50 mg/ml Proteinase K and 0.5% SDS and incubating for 1 h at 65°C, followed by 
successive extractions with 1 volume of phenol, 1 volume of phenol-chloroform and 1 
volume of chloroform. The DNA was then dialyzed overnight at 4°C against 4 L of 

10 T.E (10 mM Tris 8 0 , ImM EDTA ). 

Example 2: Preparation of Bacteriophage 77 DNA for Sequencing 
Sonication of DNA: 

15 4 fig of phage DNA was diluted in 200 jal of T.E pH 8.0 in a 1.5 ml Eppendorf 

tube and sonication was performed (550 Sonic Dismembrator, Fisher Scientific). 
Samples were sonicated under an amplitude of 3 |am with bursts of 5 s spaced by 15 s 
cooling in ice/water for 3 to 4 cycles and size-fractioned on 1% agarose gels. 
Fractions ranging from 1 to 2 kbp were isolated and gel purified by using the Qiagen 

20 kit according to the instructions of the manufacturer (Qiagen) and eluted in 50 |txl of 
Tris ImM, pH 8.5. 
Repair of fragmented DNA ends: 

The ends of the sonicated DNA fragments were repaired with a combination of 
T4 DNA polymerase and Klenow as follows. Reactions were performed in a final 

25 volume of 100 \il containing DNA, 10 mM Tris-HCl pH 8.0, 50 mM NaCl, 10 mM 
MgCl 2 , 1 mM DTT, 5 |ig BSA, 100 pM of each dNTP and 15 units of T4 DNA 
polymerase (New England Biolabs) for 20 min at 12°C followed by addition of 12.5 
units of Klenow large fragment (New England Biolabs) for 15 min at room 
temperature. The reaction was stopped by two phenol/chloroform extractions and the 

30 DNA was ethanol precipitated and resuspended in 20 jal of H 2 0. 
Cloning into pKSII and transformation: 
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Blunt-ended DNA fragments were cloned by ligation directly into Hinll (New 
England Biolabs) and calf intestinal phosphatase (New England Biolabs)-treated 
pKSII vector (Stratagene). A typical reaction contained 100 ng of vector, 2 to 5 pi of 
repaired sonicated phage DNA in a final volume of 20 (il containing, 800 units of T4 

5 DNA ligase (New England Biolabs) for overnight at 16 °C. Transformation and 
selection of positive clones was performed in the host strain DH10 (5 of E.coli using 
ampicillin as a selective antibiotic as described in Sambrook et al. (supra) 
Preparation of sequencing templates: 

Recombinant clones were picked from agar plates into 96-well plates. The 

10 presence of foreign insert was confirmed by PCR analysis using T3 and T7 primers. 
PCR amplification of foreign insert was performed in a 15-jal reaction volume 
containing 10 mM Tris (pH 8.3), 50 mM KC1, 1.5 mM MgCl 2 , 0.02% gelatin, 1 \M 
primer, 187.5 jaM each dNTP, and 0.75 units Taq polymerase (BRL). The 
thermocycling parameters were as follows: 2 min initial denaturation at 94°C for 2 

15 min, followed by 20 cycles of 30 sec denaturation at 94°C, 30 sec annealing at 58°C, 
and 2 min extension at 72°C, followed by a single extension step at 72°C for 10 min. 
Clones with insert sizes of 1 to 2 kbp were selected and miniprep DNA of the selected 
clones were prepared using QIAprep spin miniprep kit (Qiagen). 

20 Example 3: DNA Sequencing 

DNA sequencing: 

The ends of each recombinant clone were sequenced on an ABI 377-36 
automated sequencer with two types of chemistry: ABI prism bigdye primer or ABI 

25 prism bigdye terminator cycle sequencing ready reaction kit (Applied Biosystems). 
To ensure co-linearity of the sequence data and the genome, all regions of phage 
genome were sequenced at least once from both directions on two separate clones. In 
areas that this criteria was not met, a sequencing primer was selected and phage DNA 
was used directly as sequencing template employing ABI prism bigdye terminator 

30 cycle sequencing ready reaction kit. 
Sequence contig assembly: 
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Sequence contigs were assembled using Sequencher 3.1 software 
(GeneCodes). To close contig gaps, sequencing primers were selected near the edge 
of the contigs. Phage DNA was used directly as sequencing template employing ABI 
prism bigdye terminator cycle sequencing ready reaction kit. 
5 The sequence obtained for phage 77 is shown in Table 2. The sequences for 

phage 3A and 96 were obtained by similar sequencing methods; the sequences of 
those phage genomes are shown in Tables 7 & 9 respectively. 

Example 4: Sequence Analysis 

10 

Sequence analysis: 

An implementation of the publicly available program SEQUIN, available for 
download at ftp://negi.nlm.nih.gov/sequin/ . was used on phage genome sequence to 
identify all putative ORFs larger than 33 codons. A listing of such ORFs for S. 

15 aureus phage 77 is shown in Table 3, with predicted amino acid sequences for 
selected ORFs shown in Table 4. Listings of ORFs for phage 3 A and 96 are provided 
in Tables 8 and 10 respectively. A variety of other ORF identification could be used 
as alternatives and are known to those skilled in the art. Sequence homology searches 
for each ORF are then carried out using a standard implementation of blast programs. 

20 Downloaded public databases used for sequence analysis include: 
non-redundant GenBank (ftp://ncbi.nlm.nih.gOv/blast/db/nr.Z), 
Swissprot (ftp://ncbi.nlm.nih.gOv/blast/db/swissprot.Z); 
vector (ftp://ncbi.nlm.nih.gOv/blast/db/vector.Z); 
pdbaa databases (ftp://ncbi.nlm.nih.gOv/blast/db/pdbaa.Z); 

25 staphylococcus aureus NCTC 8325 (ftp://ftp.genome.ou.edu/pub/staph/staph-lk.fa); 
streptococcus pyogenes (ftp://ftp.genome.ou.edu/pub/strep/strep-lk.fa); 
streptococcus pneumoniae 

(ftp://ftp.tigr.org/pub/data/s_pneumoniae/gsp.contigs. 1 1 2 1 97 .Z); 
mycobacterium tuberculosis CSU#9 
30 (ftp://ftp.tigr.Org/pub/data/m_tuberculosis/TB_091097.Z); and 

pseudomonas aeruginosa (http://www.genome.washington.edu/pseudo /data.htmlV 

Exemplary results of homology searches are shown in Table 5 for 
bacteriophage 77. 
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Example 5: Identification of Cecropin Signature Motif in Staphylococcus aureus 
Bacteriophage 3A ORF 

The genome for S. aureus bacteriophage 3 A was determined and the sequence 
5 was analyzed essentially as described for bacteriophage 77 in the examples above. 
Upon blast analysis of the identified open reading frames of phage 3 A, the presence of 
an amino acid sequence corresponding to a cecropin signature motif was observed. 
This motif (WDGHKTLEK) is located at position aa 481-489. Cecropins were 
originally identified in proteins from the cecropia moth and are recognized as potent 
10 antibacterial proteins that constitute an important part of the cell-free immunity of 
insects. Cecropins are small proteins (31-39 amino acid residues) that are active 
% against both Gram-positive and Gram-negative bacteria by disrupting the bacterial 

~P membranes. Although the mechanisms by which the cecropons cause cell death are 

JE not fully understood, it is generally thought to involve channel formation and 

In 15 membrane destabilization. 

h! The identification of a motif corresponding to a known inhibitor suggests that 

the product of ORF002 is also an inhibitory compound. Such inhibitory activity can 
I J be confirmed as described herein or by other methods known in the art. Confirmation 

111 of the inhibitory activity would indicate that the ORF product could serve as the basis 

'% 20 for construction of mimetic compounds and other inhibitors directed to the target of 
the ORF002 product. 

Boman & Hultmark, 1987, Ann. Rev. Microbiol 41:103-126. 
Boman, 1991, Cell 65:205-207 . 
Boman et al., 1991, Eur. J. Bioichem. 201:23-31. 
25 Wang et al., J. Biol Chem. 273:27438-27448. 

Example 6: Bacteriophage 77 ORF Expression 

Bacteriophage ORFs are prepared and expressed as generally described in the 
Detailed Description above, utilizing a shuttle expression vector with a locus for 
30 insertion of a phage ORF subject to inducible expression in an appropriate host 
bacterium. 

Preparation of shuttle expression vector: 
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The shuttle vector pT0021, in which the firefly luciferase (lucFF) expression 
is controlled by the ars promoter/operator from a S. aureus plasmid (Tauriainen, S., 
Karp, M., Chang, W and Virta, M. (1997). Recombinant luminescent bacteria for 
measuring bioavailable arsenite and antimonite. Appl. Environ. Microbiol. 63:4456- 
5 4461), was modified as below to suit our specific application. Two oligonucleotides 
corresponding to the influenza HA tag were synthesized. The sense strand HA tag 
sequence (with BamHl, Sail and HindUl cloning sites) is: 

5'-gatcccggtcgaccaagcttTACCCATACGACGTCCCAGACTACGCCAGCTGA-3'; 

the antisense strand HA tag sequence (with HindUl cloning site) is: 

10 5'-agctTCAGCTGGCGTAGTCTGGGACGTCGTATGGGTAaagcttggtcgaccgg-3'. 
The two HA tag oligonucleotides were annealed following a standard protocol {supra) 
and ligated to pT0021 vector that was digested with BamHl and Hindm (the lucFF 
gene was released from the vector and replaced by the HA tag). This modified shuttle 
vector containing the ars promoter, arsR gene and HA tag was named pTHA vector. 

1 5 Cloning of ORFs with a Shine-Dalgarno sequence: 

ORFs with a Shine-Dalgarno sequence were selected for functional analysis of 
bacterial killing. Each ORF, from initiation codon to last codon (excluding the stop 
codon), was PCR amplified from phage genomic DNA. For PCR amplification of 
ORFs, each sense strand primer starts at the initiation codon and is preceded by a 

20 BamHl restriction site and each antisense strand starts at the last codon (excluding the 
stop codon) and is preceded by a Sal I restriction site. PCR product of each ORF was 
gel purified and digested with BamHl and Sail overnight. The digested PCR product 
was then gel purified, ligated into BamHl and Sail digested pTHA vector, and used to 
transform bacterial strain DHlOp. As a result, HA tag is inframe with the ORF and a 

25 fusion protein with ORF begins at N-terminal and HA tag ends at the C-terminal is 
produced. Recombinant ORF clones were picked and their sizes were confirmed by 
PCR analysis using primers flanking the cloning site. The sequence fidelity of cloned 
ORFs was verified by DNA sequencing using the same primers as used for PCR. In 
the cases that the verification of ORFs could not be achieved by one path of 

30 sequencing using primers flanking the cloning site, internal primers were selected and 
used for sequencing. 

Transformation of Staphylococcus aureus with expression constructs 
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Staphylococcus aureus strain RN4220 (Kreiswirth et al., 1983, Nature 
305:709-712) was used as a recipient for the expression of recombinant plasmids. 
Electroporation was performed essentially as previously described (Schenk and 
Laddaga, 1992, FEMS Microbiology Letters 94:133-138). Selection of recombinant 
5 clones was performed on Luria-Broth agar (LB-agar) plates containing 30 |ag/ml of 
Kanamycin. 
Chemical inducers 

Sodium arsenite (NaAs0 2 ), sodium arsenate (Na^AsOJ, and antimony 
potassium tartrate (K(SbO)C 4 H 4 0 6 ) were purchased from Sigma (Sigma-Aldrich 
10 Canada LTD, Oakville) and were used as heavy metals to induce gene expression 
from the ars promoter/operator. 
J Induction of gene expression from the ars operon 

!f: Cells containing different recombinant plasmids were grown overnight at 37°C 

•l~ in LB medium supplemented with 30 jag/ml of Kanamycin. The cells were then 

15 diluted to the mid log phase (OD 540 approx. 0.2) with fresh LB media containing 
[ y Kanamycin and transferred to 96-well microtitration plates (100 (il/well). Inducers 

^ were then added at different final concentrations (ranging from 2.5 to 10 |uM) and the 

i culture was incubated for an additional 2 h at 37°C. Control cultures without inducers 

;^ were cultured in separate wells. The effect of expression of the phage 77 ORFs on 

^0 20 bacterial cell growth was then monitored by measuring the OD540 and comparing the 
rate of growth of the culture containing inducer to the rate of growth of the culture not 
containing inducer. As positive controls for growth inhibition, the kilA gene of phage 
lambda (Reisinger et al., 1993, Virology 193:1033-1036), and the holin/lsinl genes of 
the Staphylococcus aureus phage Twort (Loessner et al., 1998, FEMS Microbiology 
25 Letters 162:265-274) were subcloned into the ars inducible vector and included in 
separate wells of the microtitration plate. 

Expression of ORFs from a large variety of other phage can be accomplished 
using the above vector, or other vector adapted for an appropriate bacterium and 
preferably for inducible expression of the insert ORF or ORFs. 

30 

All patents and publications mentioned in the specification are indicative of 
the levels of skill of those skilled in the art to which the invention pertains. All 
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references cited in this disclosure are incorporated by reference to the same extent as 
if each reference had been incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as 

5 well as those inherent therein. The specific methods and compositions described 
herein as presently representative of preferred embodiments are exemplary and are not 
intended as limitations on the scope of the invention. Changes therein and other uses 
will occur to those skilled in the art which are encompassed within the spirit of the 
invention are defined by the scope of the claims. 

1 o It will be readily apparent to one skilled in the art that varying substitutions 

and modifications may be made to the invention disclosed herein without departing 
from the scope and spirit of the invention. For example, those skilled in the art will 
recognize that the invention may suitably be practiced using a variety of different 
bacteria, bacteriophage, and sequencing methods within the general descriptions 

15 provided. 

The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising," "consisting essentially of and "consisting of may be replaced with 

20 either of the other two terms. The terms and expressions which have been employed 
are used as terms of description and not of limitation, and there is not intention that in 
the use of such terms and expressions of excluding any equivalents of the features 
shown and described or portions thereof, but it is recognized that various 
modifications are possible within the scope of the invention claimed. Thus, it should 

25 be understood that although the present invention has been specifically disclosed by 
preferred embodiments and optional features, modification and variation of the 
concepts herein disclosed may be resorted to by those skilled in the art, and that such 
modifications and variations are considered to be within the scope of this invention as 
defined by the appended claims. 

30 In addition, where features or aspects of the invention are described in terms of 

Markush groups or other grouping of alternatives, those skilled in the art will 
recognize that the invention is also thereby described in terms of any individual 
member or subgroup of members of the Markush group or other group. For example, 
if there are alternatives A, B, and C, all of the following possibilities are included: A 

35 separately, B separately, C separately, A and B, A and C, B and C, and A and B and 
C. Thus, for example, for the bacteria and phage specified herein, the embodiments 
expressly include any subset or subgroup of those bacteria and/or phage. While each 
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such subset or subgroup could be listed separately, for the sake of brevity, such a 
listing is replaced by the present description. 

Thus, additional embodiments are within the scope of the invention and within 
the following claims. 
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CLAIMS 

What is claimed is: 

5 1. A method for identifying a bacteriophage coding region encoding a product 
active on an essential bacterial target, comprising identifying a nucleic acid sequence 
encoding a gene product which provides a bacteria-inhibiting function when said 
bacteriophage infects a host bacterium, 

wherein said bacteriophage is uncharacterized and said host bacterium is a 

1 0 pathogenic bacterium. 

2. The method of claim 1, further comprising expressing a recombinant 
bacteriophage ORF in cells of a bacterial strain, wherein inhibition of said cells 
following expression of said ORF is indicative that said product is active on an 

1 5 essential bacterial target. 

3. The method of claim 2, wherein inhibition of said bacterium following 
expression of said ORF is determined by comparison with the growth or viability of 
said bacterium following expression of an inactivated mutant form of said ORF or in 

20 the absence of expression of said ORF, and wherein inhibition of said bacterium 
following expression of said ORF is indicative that said product is active on an 
essential bacterial target. 

4. The method of claim 2, wherein expression of said ORF is inducible. 

25 

5. The method of claim 1, further comprising confirming the inhibitor function of 
said ORF. 

6. The method of claim 5, wherein said confirming comprises expressing a loss- 
30 of-function mutant form of said ORF in said host bacterium. 

7. A method for identifying a potential target for antibacterial agents, comprising 
determining the bacterial target of an uncharacterized bacteriophage inhibitor protein: 

35 

8. The method of claim 7, wherein said determining comprises identifying at 
least one bacterial protein which binds to said bacteriophage inhibitor protein or a 
fragment thereof. 

SD-63311.4 
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9. A method for identifying a potential target for antibacterial agents, comprising 
determining the bacterial target of an uncharacterized bacteriophage inhibitor protein. 

5 10. The method of claim 9, wherein said determining comprises identifying at 
least one bacterial protein which binds to said bacteriophage inhibitor protein or a 
fragment thereof. 

11. The method of claim 9, wherein said determining comprises identifying at 
1 0 least one proteimprotein interaction using a genetic screen. 

12. The method of claim 9, wherein said determining further comprises 

0 identifying a bacterial nucleic acid sequence encoding a polypeptide target of said 
%I bacteriophage inhibitor protein. 

II! 15 

13. The method of claim 12, wherein said nucleic acid sequence is identified by 

1 ?! determining at least a portion of the amino acid sequence of a bacterial protein target, 
ft! and identifying a bacterial nucleic acid sequence which encodes said protein target. 

I1J 20 14. The method of claim 9, further comprising identifying a bacteriophage ORF 
y which encodes a product having a bacteria-inhibiting function. 

15. The method of claim 14, wherein said identifying a phage ORF comprises 
expressing at least one bacteriophage ORF in a bacterium, wherein inhibition of said 
25 bacterium following said expression is indicative that said ORF encodes a bacteria- 
inhibiting function. 



16. An isolated, purified, or enriched nucleic acid sequence at least 15 nucleotides 
30 in length, wherein said sequence corresponds to at least a portion of a bacteriophage 

sequence, and wherein said bacteriophage is selected from the group consisting of 
Staphylococcus aureus bacteriophage 77, 3A, and 96. 

17. The nucleic acid sequence of claim 16, wherein said nucleic acid sequence 
35 corresponds to at least a portion of a nucleic acid sequence which encodes a product 

which provides a bacteria-inhibiting function. 
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18. The nucleic acid sequence of claim 16, wherein said nucleic acid sequence is 
transcriptionally linked with regulatory sequences enabling induction of expression of 
said sequence. 



10 



19. An isolated, purified, or enriched polypeptide comprising at least a portion of a 
protein providing a bacteria-inhibiting function, wherein said polypeptide is normally 
encoded by a bacteriophage selected from the group consisting of Staphylococcus 
aureus bacteriophage 77, 3 A, and 96. 

20. The polypeptide of claim 19, wherein said polypeptide provides said bacteria- 
inhibiting function. 



15 21. A recombinant vector comprising a bacteriophage ORF corresponding to an 
ORF from a bacteriophage having a pathogenic bacterial host, wherein said bacterial 
host is selected from the group consisting of uncharacterized bacteria of Table 1. 



20 22. A recombinant cell comprising a vector, wherein said vector comprises an 
ORF from a bacteriophage having a pathogenic bacterial host, wherein said bacterial 
host is selected from the group consisting of bacterial species of Table 1. 



25 23. A method for identifying an antibacterial agent, comprising identifying an 
active portion of a product of a bacteria-inhibiting ORF of a bacteriophage. 

24. The method of claim 23, further comprising constructing a synthetic 
peptidomimetic molecule, wherein the structure of said molecule corresponds to the 
3 0 structure of said active portion. 



25. A method for identifying a compound active on a target of a bacteriophage 
inhibitor protein, comprising the step of 
3 5 contacting a bacterial target protein with a test compound; and 

determining whether said compound binds to or reduces the level of activity of 
said target protein, 
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wherein binding of said compound with said target protein or a reduction of 
the level of activity of said protein is indicative that said compound is active on said 
target and wherein said target is uncharacterized. 



10 



26. A method of screening for potential antibacterial agents, comprising the step 
of determining whether any of a plurality of compounds is active on a target of a 
bacteriophage inhibitor protein, 

wherein said target is naturally produced by a pathogenic bacterium. 



27. A method for inhibiting a bacterium , comprising the step of; 

contacting said bacterium with a compound active on a target of a 
bacteriophage inhibitor protein, wherein said target or the target site is 
1 5 uncharacterized. 



28. A method for treating a bacterial infection in an animal suffering from an 
infection, comprising administering to said animal a therapeutically effective amount 
20 of compound active on a target of a bacteriophage inhibitor protein in a bacterium 
involved in said infection, 

wherein said target is an uncharacterized target or the compound is active at an 
uncharacterized target site. 



29. A method for propylactically treating an animal at risk of an infection, 
comprising administering to said animal a prophylactically effective amount of a 
compound active on a target of a bacteriophage inhibitor protein, 

wherein said target is an uncharacterized target or the site of action of said 
30 compound is an uncharacterized target site. 



30. An antibacterial agent active on a target of a bacteriophage inhibitor protein, 
wherein said target is an uncharacterized target or said agent is active at a phage- 
35 specific site on said target. 
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31. A method of making an antibacterial agent, comprising the steps of: 

a) identifying a target of a bacteriophage inhibitor polypeptide; 

b) screening a plurality of test compounds to identify a compound active on 
said target; and 

5 c) synthesizing said compound in an amount sufficient to provide a 

therapeutic effect when administered to an organism infected by a bacterium naturally 
producing said target. 

10 32. A computer readable device having recorded therein a nucleotide sequence of 
a portion of at least one bacteriophage genome of Staphylococcus aureus 
bacteriophage 77, bacteriophage 3A, or bacteriophage 96, a nucleotide sequence at 
least 95% identical to a said nucleotide sequence, a ribonucleic acid equivalent, a 
degenerate equivalent, a homologous sequence, or at least one amino acid sequence 
1 5 encoded by said nucleotide sequence; and 

a nucleotide sequence or amino acid sequence analysis program, 
wherein said program can perform at least one sequence analysis on said 
nucleotide or amino acid sequence. 

20 

33. A computer-based system for identifying biologically important portions of a 
bacteriophage genome, comprising: 

a) a data storage medium having recorded thereon a nucleotide sequence 
corresponding to a portion of at least one bacteriophage genome, wherein said 

25 bacteriophage genome is uncharacterized; 

b) a set of instructions allowing searching of said sequence to analyze said 
sequence; and 

c) an output device. 

30 34. The system of claim 33, wherein said bacteriophage genome is of a 
bacteriophage selected from the group consisting of uncharacterized bacteriophage 
listed in Table 1. 

35 35. A method for identifying or characterizing a bacteriophage ORF, comprising 
the steps of: 

a) providing a computer-based system for analyzing nucleic acid or 
amino acid sequence data, wherein said system comprises a data storage 
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medium having recorded thereon at least one nucleotide or amino acid 
sequence corresponding to a portion of at least one uncharacterized 
bacteriophage genome, a set of instructions allowing searching of said 
sequence to analyze said sequence; and an output device; 

b) analyzing at least a portion of at least one said sequence; and 

c) outputting results of said analyzing to said output device. 
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ABSTRACT 



A method for identifying suitable targets for antibacterial agents based on 
identifying targets of bacteriophage-encoded proteins is described. Also described are 
compositions useful in the identification methods and in inhibiting bacterial growth, 
and methods for preparing and using such compositions. 
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Table 1 

Phages against human and animal pathogenic bacteria 



Pathogen name 


Phage name 


Catalog# 


Origin/reference 


Acinetobacter 
c<x Icoaceticus 


A3/2 

A36 

B9GP 

B 9 PP 

BS46 

E13 

E14 

531 




Felix d'Herelle Reference 
Centre, Quebec, Quebec 




Ap3 
P78 




J.Bacteriol 1984. 157: 179-183 

J. Gen. Microbiol 1986.132: 2633-2636 


Acinetobacter 
haemolyticus 


2213/73 




Felix d'Herelle Reference 
Centre,Quebec,Quebec 


Acinetobacter 
johnsonii 


133 




Felix d'Herelle Reference 
Centre.Quebec, Quebec 


Acinetobacter sp. 


BP I 




J. Virol. 1 968.2 :7 1 6-722 


(IrA 1TP9 tTP3 St 
vjr*f , xxrZ, ilr J oc 

HP4 




L^an. j .lviicroDioi. i y oo . i z . i uzj - 1 iou oc 
J Virol 1974 13*46-52 & 
Arch. Virol. 1 994 . 1 3 5 : 345-3 54 




Al, A4, A9& 
196 




Arch. Virol. 1 994. 1 3 5:345-354 




HPl 




Can. J.Microbiol. 1 966. 1 2 : 1 023- 1 030 




Al9, A23, A29, 
A31,A33, A34, 
A3759 & 2845 




J.Microsc (Paris) 1973.16:215-224 & 
CR.Hebdo Seances Acad.Sci.Ser D.Sci 
Natur(Paris)278: 1907-1909 & 
Arch. Virol. 1994. 135:345-354 & 
Rev.Can.Biol.l970.29:317-320 


Actinobacillus 


cpAa 




FEMS Microbiol Lett 1994. 1 19:329-337 


actinomycetecomitans 


(pAal7 




Infec, Immun. 1982. 35: 343-349 




Aacp23 




Mol.Gen.Genet 1998.258: 323-325 




Aacp247 




Oral Micriol. Immunol 1997.12: 40-46 


Actinomyces viscosus 


Av-l 


43146-B1 


The American Type Culture Collection 


AV-2,AV-3 
&1281 




Infect.Immun.l985.48:228-233 




BF307 & CT7 




Infectimmun.l988.56:54-59 




phi225 




Plasmid 1997.37:141-153 


Aeromonas hydrophila 


PM2** &PM3 




FEMS MicrobioLLett. 1990.57:277-282 
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Aehl 
Aeh2 
PM4 
PM5 
PM6 
T7-ah 




Felix d'Herelle Reference 
Centre,Quebec,Quebec 


Aeromonas 
salmonicida 


3 

25 
29 
31 
32 

40RR 2 . 8 t 

43 

51 

56 

59.1 

Asp37 




Felix d'Herelle Reference 
Centre, Quebec,Quebec 




55R.1 




Can. J. Microbiol 1983. 29: 1458-1461 


Alteromonas espeji&nci 


PM2** 


27025-B1 


The American Type Culture Collection 


Asticacaulis 
biprosthecum 


Bla 




Felix d'Herelle Reference 
Centre s Quebec,Quebec 


Asticcacaulis 
excentricus 


4>Ac20 
<|)Ac21 
(|)Ac24 


15261-B1 
15261-B2 
15261-B3 


The American Type Culture Collection 


/iZOlUUuLier vlfl&lurlull 


A12 
A14 
A21 
A31 
A41 

IA V 1 


12518-Bl 

12518-B4 
12518-B5 
12518-B9 
12518-BlO 

U I \J J 1J i. 


The American Type Culture Collection 


JiZOlOOuCier •>//. 






Virology 1972.49:439-452 


Bacteroides fragilis 


Bf-1 




Rev. Infect Dis. 1979. 1: 325-336 


rj/IA o 

B40-5 




pHIVIij iviicroDioi. IjClu vyyi* uu. ui*u/ 




HSP40 




Appi. .environ. iviicroDioi, 1707. jj. zvyv- 
2701 




1 * A 1 

pniAl 




Z/eniraiDi.DaKierioi. iy / z.zzz. j i 


Bdellovibrio 
bacteriovorus 


MAC-1 




J. Gen. Microbiol. 1987. 133: 3065-3070 


Bdellovibrio sp. 


VL-1 




T Virol 1973 12-1522-1533 


Bordetella 
brochiseptica 


214 




Zh.Mikrobiol.Epidemiol.Immuno. 1987.5:9- 


Bordetella 
n n Tfinprtu *ts i v 


L-l 

i UXlallld. 




Felix d'Herelle Reference 
Centre,Quebec ? Quebec 


phiT 




Mol. Gen. Mikrobiol. Virusol. 1988.4: 22-25 




134 




Zh.Mikrobiol.Epidemiol.Immuno. 1987.5:9- 
13 




41405 




ZhMila-obiol.Epidemiol.Immuno. 1987.5:9- 
1 13 



SD-63311.4 
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Brucella abortus 


S708 

Lit <Jt u 




Felix d'Herelle Reference 
Centre 3 Quebec,Quebec 




Tbilisi 

1 f~\ IT 

10/1 

0/1 /TT 
Z4/11 

010 / YV 
Z 1 Li A V 

J / 11 AA1A 


23448-Bl 

OlyMO "DO 

Zj44o-I3J 
1 /jOJ-DI 

17385-B2 


The American Type Culture Collection 




I5JS.-Z, ID oc 
Pi** 




7h IVTilrrrVhinl PniHpminl TmnrninrVhiol 1981 2* 

48-52 








Dpv Ri'ol Stand 1984 56*55-62 


Brucella corns 


R/c 




Dev. Biol. Stand. 1984.56: 55-62 


Brucella melitensis 


BK-2 


23456-B1 


The American Type Culture Collection 


Brucella suis 


Wb 




Zentxalbl. Vetennarmed. 1 975 .22 : 500-5 67 




Fi** & TB 




Zh.Mikrobiol.Epidemiol.ImmunobioL1983,2: 
48-52 


Brucella sp. 


Np (Nepean) & 
Iz 




Can. J. Vet. Res. 1989.53: 319-325 




Iz-1 




Res. Vet. Sci. 1988. 44: 45-49 




R 




Zh.Mikrtobiol.Epideixuol.Immunobiol. 1 983 .2: 
48 


Campylobacter coli 

Campylobacter coli 
(Cont'd) 


17 
18 
19 

20 


43133- Bl 

43134- Bl 

43135- Bl 

43136- Bl 


The American Type Culture Collection 
The American Type Culture Collection 


Campylobacter jejuni 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

1 ^ 

14 
17 

1 o 

18 
19 
20 


35918- Bl 

35919- Bl 

35920- Bl 

35921- Bl 
35918-B2 
35920-B2 

35922- B2 

35923- Bl 

35924- Bl 

35925- Bl 
35925-B2 
35922-B2 
35924-B2 
35922-B3 
411 

43135-Bl 

nfi "R1 

4 j 1 -50-151 


The American Type Culture Collection 


Campylobacter 
(Helicobacter) pylori 


HP1 




J. Med. Microbiol. 1993. 38: 245-249 


Chlamydia psittaci 


Chpl** 




J. Gen. Virol. 1989. 70: 3381-3390 


Clostridium 
acetobutylicum 


CAK-1 




J.Bacteriol.l993.175:3838-3843 


Clostridium botulinum 


C** & D** 




Nucleic Acids Res.l990.18:1291 




C-ST** 




Bioch.Biophys.res.Commun. 1 990. 1 7 1 . 1 304- 
1311 


SD-63311.4 


al &a2 




Microbiol.immunol. 1981.25:91 5-927 
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d-l6phi**, d-l\ 




J. Vet.Med.Sci. 1992.54:675-684 




/"II ' O O /"^T^ 

CE p &CE y 






Clostridium difficile 


A 1 JPr 

41 oc DO 




J. Micro D101. IVoj.ZI .ZD1-ZD4 


Clostridium 


PF1,PF2, PF3 & 




Rev.Can.BioL1977.36:205-215 


perjringens 


PF4 








q>29** & (p59 




rEMS Microbiol.Lett 1990.54:323-326 


{^lOSTrlulUJfl 


T 
J 


Qf\lA Til 
oU / 4-151 


The American Type Culture Collection 


sporogenes 




1 / 000-13 1 






70 


17886-B3 






71 


17886-B4 






72S 


17886-B5 






70 T 

/ZLf 


17886-B6 




— ; — z. : 

Clostridium tetuni 


AOCD 




Kev.can.rsioi.iy /0.3 /:4J-4o 


Covyti ebdcterium 


T~>T7 in J, rtO/l 

rsr, <py oc <p b>o4 




vopr. vrrusoi. iyoo.3 1 . j / /Oo4 


diphteriae 






LsOryn suuc i&rx urn 


IN IN 


1 01 1 Q "R1 

IZo iy-r>i 


i ne American 1 ype uuirure i^onecuon 


Tjspudotuhpvculo V7 V 








Corynebacterium sp 


DLC 2921/49 


12052-Bl 


The American Type Culture Collection 


Enterococcus faecalis 


42 


19948-B1 


The American Type Culture Collection 


Enterococcus faecium 


113 


19950-B1 


The American Type Culture Collection 




124 


19953-b2 






133 


19953-B1 





SD-63311.4 
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AP211 


11303-B14 


The American Type Culture Collection 




BG3 


11303-B10 






C33 


11303-B21 






C36 


8677-B1 






C204 


11303-B13 






El 


13706-B4 




Escherichia coli 


q*# 


15766-B1 


The American Type Culture Collection 


(Cont'd) 


f2** 


15766-B1 






FCZ 


1242-B5 






fd** 


15669-B2 






ft** 


15767-B1 






G178 


11303-B16 






Ifl** 


27-65-B1 






If2 


25065-B2 






iVl 1 _> 


15669-B1 






MS2** 


15597-B1 






MU9 


21816-B1 






Mu-1 


23724-B9 






0x6 


15593-B1 






PI** 


25404-B1 






P4 sidi ** 


29746-B1 






0-6** 

V F 


23631-B1 






R17** 


25868-B1 






Z1K/1 


25298-B1 






ZJ/2 


25298-B2 






rA105 


11303-B37 






rEDa41 


11303-B24 








11303-B26 








11303-B27 








11303-B28 








11303-B29 








11303-B30 






rH88 


11303-B33 






rJ3 


11303-B31 






r71 


11303-B25 






Tlo / 


11303-B35 






riyo 


11303-B34 






IDjO 


11303-B36 






ri joy 


11303-B32 






S13** 


13706-B5 






A l 


11303-B1 






l 2 


U303-B2 






T 


11303-B3 






T ** 

A 4 


11303-B4 






T amA4S3 


35060-B1 






T amftl7 


35060-B2 






T ntviMI 


35060-B3 






1 -J 


11303-B5 






T6** 


11303-B6 








11303-B7 






T 7 M 


11303-B38 




SD-63311.4 


5 


12141-B1 








12144-B3 
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Escherichia coli 
(Cont'd) 



SD-63311.4 



250 
547 
UV1 
UV47 
UV375 
a3** 
X ** 
XC-17 
X sus P-3 
X sus R-5 
X sus J-6 
X sus 0-8 
Xsus A-ll 
X ind" 
092 

*z*V-l 

^X174** 

^Xcs70am-3 



G4** & iK** 



BF23 5 * 



Mul 



Hpl7 



K3** & 0x2** 



Rbl8**,Rb51 & 
Rb69** 



HI**, H3, H8, 
K9, 

K18 & Oxl 



Ml** Tula** & 
Tulb** 



K10 



Qsr' 



B278 



phi 80* 



phi m!73 



tf-1 



P4&phiR73 



PRDl 



K3hx 



933J**& 
933 W** 



H19-B** 



Tcp-111 



N4* 



Phi 80 trp 



Obeta 1 



11303-B20 

11303-B17 

11303-B15 

11303-B11 

11303-B18 

13706-B2 

23724-B2 

23724-B1 

23724-B3 

23724-B4 

23724-B5 

23724-B6 

23724-B7 

23724-B8 

35860-B1 

13706-B3 

15597-B2 

13706-B1 

49696-B1 



The American Type Culture Collection 



Biochim,Biophysica Acta. 1992.11 3 0:277-288 



J.Bacteriol 1 977. 1 29:265-275 



J.Ultrastruct,Res.l966.14:441-448 



J.Mol.Biol 1991.21 8:705-72 1 



FEBS Lett.1987.21S: 145-150 



J.Bacteriol. 1990. 172: 1 80-1 86 



MoL Gen.Genet. 1 990.22 1 :49 1 -494 



J.MoLBiol 1 987. 1 96: 1 65- 1 74 



J.Bacteriol. 1979. 140:680-686 



J.Bacteriol. 1985. 1 62:256-262 



J.Gen.Microbiol.l988.134:1333-1338 



FEMS Microbiol.Lett. 1 994. 1 19:71-76 



Genetika 1985.21:673-675 



J.Gen.Microbiol.l987.133:953-960 



MoLMicrobiol. 1995.18:201 -208 



J.Gen.Microbiol.l982.128:2797-2804 



Virology 1990.177:445-451 



Mol.Gen.Genet.l987.206:l 10-1 15 



Infect.Immunity . 1 986.53 : 1 35- 1 40 



J.Bacteriol 1987. 169:4308-43 12 



ZentralbnLBakteriol.MikrobioLHyg. 1 988.270 
:41-51 



VetMicrobiol. 1 992.30:203-2 1 2 



AnninstPasteur. 1971.120:121-125 



J.Bacteriol. 1 978. 1 33 : 1 72- 1 77 
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P1CM 




J.Gen.Microbiol.1978. 107:73-83 




PA-2** 




J.Bacteriol. 1 990. 1 72 : 1 660- 1 662 




186** 




Mol.Gen.Genet 1982. 187:87-95 




186.IX.B 




MoLMicrobiol. 1992.6:2629-2642 




21** 




Virology 1983.129:484-489 




P4** 




MicrobiolRev.l993.57:683-702 




82** 




J.Biol.Chem. 1 987.262: 1 1 72 1 - 1 1 725 




PSP3 




J.Bacteriol.1996. 178:5668-5675 




HK022** 




Nucleic Acids Res. 1994.22:354-356 




D108** 




Nucleic Acids Res.l986.14:3813-3825 


Escherichia coli 


Rb49 




J.Mol.Biol. 1 997.267:237-249 


(Cont'd) 


Ike** 




JMol Biol 1985 1 81 -97-^0 




P22dis 




Mnl Gen Cr(*r\pf 107R 1^^«9^^ 




N15** 




TT^arterinl 1006 175MA8J. 








Prnr TZ *\nr T o-nrl "R "Rinl 1QQ1 O^^-O'J ^fi 
Jr i UL.JV.ou^.i^onu.i3, 15101. oCl. iyy 1 .Z4D.Z j-jU 




Stx2Phi-I & 








Sfx2Phi-II 








18 




Virology 1987.156:122-126 




X 




J Gen Mirrohiol 1QR1 1?fi-^SQ ^QA 




AC3 




Mnl Mirrnhirtl 1001 S-71^ 77^ 




BW-1 




reiix a rierene Keierence 




C-l 




Centre 3 Quebec, Quebec 




E920g 








Esc-7-11 








H19J 








Haiti 








HK243 








la 








K20 








K30 








KL 3 








M 








Mu** 








O103 








0157:H7 








P1D 








ptl 








PilHa 








PR64FS 








PR772 








SS4 








P4Q 








^vir** 








Q8 








09-1 








92 






Haemophilus 


HP1** 




Nucleic Acids Res. 1996.24:2360-2368 


influenzae 


S2** 




Gene 1997. 196: 139-144 



SD-63311.4 
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Halobacterium 
cutirubrum 


S45 




Felix d'Herelle Reference 
Centre,Quebec,Quebec 


Halobacterium 
halobium 


$H&<|>N 




Felix d'Herelle Reference 
Centre,Quebec,Quebec 




TTL1 O TTL") 

Hhl & Hn3 




Can, J .Microbiol. 1982.28:91 6-92 1 


Halobacterium 
salinarium 


PhiH 




Biol.Chem.Hoppe Seyler 1994.375:747-757 


Klebsiella oxytoca 


i 

ti-l 




J.uen.Microbioi.iyo /. lii:y!)i-yoU 


Klebsiella pneumoniae 


OVJ 
00 


23357-B1 


The American Type Culture Collection 




K19Q 




Felix d'Herelle Reference 
Centre,Quebec,Quebec 




FC3-1 & FC3-9 




Can J.Microbiol. 1 99 1 .3 7:270-275 




FC3-10 




FEMS Microbiol.Lett 1 99 1 .67:29 1-297 


Klebsiella sp. 






ivioi.ijen.vjenet. lyyu.zzi .Zoj-Zoo 


Leptospira sp. 






Kes.Microuioi. iyyu. 141.1 131-1 1 jo 


Listeria 


243 


23074-B1 


The American Type Culture Collection 


monocytogenes 


197,1313 & 
9425 




Appl.Environ.MicrobioL1997.63:3374-3377 




H387 & H387-A 




Appl.Environ.Microbiol. 1993.59:29 14-29 1 7 




5775,6223 
&12682 




APMIS.1993.101:160-167 




2389, 2671, 
4211 & 2685 




Intervirology 1994.37:31-35 & 

ZentralbLBakteriol.MikrobioLHyg. 1 986.261 : 
12-28 




4b,4ab,4g &3c 




Ann.Microbiol (Paris) 1977.128:185-198 




A118,A500& 
A511** 




Mol.Microbiol 1995.16:1231-1241-992 




1,3,4, 5, 6, 7, 8, 
9, 10, 11, 14, 15, 
16, 17, 19 & 20 




Ann.Microbiol. (Paris) 1979.130B:179-189 




l/2a, l/2b, 3c, 
4ab, 6a & 6b 




Clin.Invest.Med. 1984.7:229-232 




<|>LMUP35 
2685 




Felix d'Herelle Reference 
Centre,Quebec,Quebec 


Listeria innocua 


4211 




Felix d'Herelle Reference 


Micrococcus luteus 


Nl 
N3 
N4 
N8 


4698-Bl 
4698-B4 
4698-2 
4698-B3 


The American Type Culture Collection 


Micrococcus luteus 


N17 




Can. J.Microbiol. 1 979.25 : 1 027- 1 035 



SD-6331L4 
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Mycobacterium 
smegmatis 


BK-3 
Bol** 
Bo 6 
Bo 611 

Tin ATTT 
tJO 0111 

Mc-2 
Mc-4 
NN 

Phagus lacticola 
Rl 


27203- B1 

27204- B1 

27205- B1 
27205-B2 

Z fZ\JD-DD 

607-B6 

607-B7 

11727-B1 

11759-B1 

607-B1 


The American Type Culture Collection 


if! 




33D 
BKl 

ciarK 

TYMA TTT 
L/rN/\ ill 

Leo 
Roy 
Sedge 


HER 3 17 
HER 330 

TTT7T> 
HJiK DDD 

liJ_-iV DDD 

HER 33 1 
HER 316 
HER 332 


Felix d'Herelle Refrence 
Centre,Quebec,Quebec 










Mol.MicrobioL 1 993 .7 :395-405 






D29 




J.Mol.BioL1998.279:143-164 






LI 




Proc.NatLAcad.Sci USA. 1988.84:2833-2837 


I U 




B 




Mol.Biol.Rep. 1981.30:11-15 


I $1 




TM4 




Proc.Natl. Acad. Sci.USA 1 997.94: 1 096 1 - 

1 hq/^a 
luyoo 


s 

.watt 




29M, 31M, 122, 
154, 37, 29D, 46, 
139,110,141, 
74D, AGl & 
DS6A 




Arch. Virol.1993. 133:39-49 & 
Am.Rev.Respir.Dis. 1975. 1 12: 17-22 


. 4 s : i: 


Mycobacterium 
fortuitum 


NN 
Bo 4 
Bo 7 


23052-B1 
27207-B1 
27207-B2 


The American Type Culture Collection 




Mycobacterium leprae 


BKl, Clark, 
Sedge, Baits, 
Watson & D29 




AmMicrobiol. (Paris) 1982,133:93-97 




Mycobacterium 
i uuercutob is 


LG 

DS6A 

D-34 


25618-B1 

ZjOIo-dZ 

4243-B1 


The American Type Culture Collection 






110, 139&33D 




Arch.ViroL1993.133:39-49 






AG1,GS4E, 
BG1, 

PH & BKl 




The Biology of Mycobacteria. Academic 
Press Joronto 1982 (Ratledge & Stanford) 
1982.309-351 




Mycobacterium sp 


Phagus pellegrini 

NN 

Bl 


11760- B1 

11761- B1 

23239-B1 


The American Type Collection Culture 






TM4, ph60, 

ph72, 

PhAE39, 

phAE40 

&Bxbl 




Microbiology 1995.141:1173-1181 




SD-633U.4 


C2 




Experentia 1969.25:1112-1113 
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18 & 115 




J Gen Virol 1987 68*949-956 


63 




Gruzlica 1968 36*617-622 


phlei & 
butyricum 




T Gen Virnl 1Q75 90*7^5 7^R 


MyF3P-59a 




Z All? Mikrnhinl 1 968 8 '79-^7 


Bo2a 




J.Gen. ViroL1973.20:75-87 


D4,D28&D32 




J.Exptl.Med. 1966. 123:327-340 


HC 




IBacteriol. 1963.86:608-609 


Mycobacterium 
vaccae 


B5 


15483-B1 


The American Type Culture Collection 


Mycobacterium phlei 


NN 
NN 

IN IN 

Bo 2 
Bo2h 

DO J 


11728-Bl 

1 1 75s ri 

27086-Bl 
z/zUo Bl 


The American Type Culture Collection 


iviycoptasma 
arthritidis 


MA V I** 




Infect.Immunity. 1 995.63 :40 1 6-4023 


Mycoplasma hyorhinis 


Hr-1 




Arch Virol 198^ 77*8 1-85 

V 11 XJl, 1 70J. / / .O l"OJ 


Mycoplasma 
pneumoniae 


Br-1 




Arrh Virnl 108^ 75*1 15 

iVl^ll. V llUi. 1 70J, / J. 1-1 J 


Mycoplasma pulmonis 


PI 




Plasmid 1995. 33: 41-49 


Mycoplasma sp. 


MV-01 




J.Gen.Microbiol. 1985: 13 1:3117-3126 


LI 




J. Virol.l986.59:584-590 


L2** 




Gene 1994. 141: 1-8 


L3 fMV-L3) 




MiVrnVnnc 1 QQO 111 17^ 


MAV-1 




Tnfprtirvn/Cr Trnmiinihr 100^ £1- /101£ /IA.T2 

iiiicciionoc iinmuniry iyyj, oj. 4U10-4uZj 


20-P 




M>H Rinl 1087 £0*1 1 A 170. 


MV-L2 & 

T lift 

MV-lg-pS2- 
L172 




Arrri Virnl 1 07Q £1*780 7GA 


MV-lg-L 172 




Acta. Virol 1978.22:443-450 


BN1 




J.Gen. Virol. 1 979.42:3 1 5-322 


MVL51 




Virology 1973.55:118-126 


MVLl, 
MVL52 & 
MVG51 




Science 1971.173:725-727 


Neisseria perflava 


NP-1 




J.Clin.MicrobioL1976. 4:87-91 


Nocardia erythrypolis 


q>C 




J.Gen. ViroL1974.23:247-254 


q>EC 




TRantprinl 1 076 17A-11AA 1 1 07 
J , JI>aL LCI 1U1. 1 y f\J. 1ZO. 1 J.LH—1 lU / 


Pasteurella multocida 


B225 




Arch.Exp.Veterinarmed.l981.35:433-436 


B939a 




AmJ.Vet.Res. 1978.39:1565-1566 




Nos.115, 32, 967 
& 

1075 




Vet.Med.Nauki. 1977.14:33-36 


Propionibacterium 
acnes 


NN 


29399-B1 


The American Type Collection Culture 



SD-63311.4 
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Pseudomonas 


1 


1217S-R1 


i iie American i ype LAiiture collection 


aeruginosa 


2 






2A 








2B 


12175-B4 






11 


14205-B1 






16 


14206-B1 






24 


14207-B1 






27 


14208-B1 






44 


14209-B1 






73 


14210-B1 






95 


1421 1-B1 






109 


14212-B1 






113 


14213-B1 






249 


14214-B1 






B3 


15692-B1 






Hoff 2 

Hull £t 


14203-B1 






Hoff 3 


14204-B1 






Pa 


12055-B1 






Pb 


12055-B2 






PB-1 


15692-B3 






Pc 


12055-B3 






Pf 


25102-B1 






PP7** 


15692-B2 






SD1-M, $wl4, 




Felix d'Herelle Reference 




7&31 




Centre,Quebec,Quebec 




Pf3** 




J.Virol.l983.47:221-223 




cp-MC 




Can. J.Microbiol. 1 969 . 1 5 : 1 1 79- 1 1 86 



Pfl* 



PR4* 



A7 



KF1 



<€TX* 



f2** 



J.Mol.Biol. 1991.218 :349-364 



J.Gen. Virol. 1 979.43 :583-592 



J.Bacteriol. 1 992. 1 74 :2407-24 1 1 



J.Biochem. 1983.93:61-71 



MoLMicrobiol. 1 993 .4 : 1 703- 1 709 



J.Virol. 1 977.24: 135-141 



SD-63311.4 
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(pi^Z,, ZI, (pJNZ, 




Arch. Virol. 1 993 ,131:141-151 




PMN17, PTB80, 








68, PB-1,E79, 








16, 








109,352, 1214, 








F8,71,337,M4, 








(pC17 SL2 B17 








Li-24 (f>rrmP7& 








PS17**, cpl,73, 








Mo, Li-2, 7, 








/■(V*wnT7C / ) 

cpmiiroZ, 








PTB2, PTB20, 








PTB42, cpKF77, 








31,PTB21, 








119x, 








<pPLS27, B3, 








258, 








Hwl2, PM57, 








PM62, PM105, 








148, PM681, 








198, 








218, 222, 242, 








246, 








PC131,cpCll, 








JSL5, 
























riVloy, rMU, 








x iviu i , .rivi 1 1 jj 








(p240, 249 & 269 






Pseudomonas 


297,309,318, 




Arch.ViroL1993.131:141-151 


aeruginosa 


11, 






(Cont'd) 


PH51.342, 351, 








PH93, 357-1, 13, 








14, PCI 1-1, 267, 








D3**,PC351, 








KF, PM63, 








PH132, , 1°, cpX, 








400-1, 45, SM, 








SL3, SL1, tpll, 








F10, <pC15, 160, 








20,336, 350, 








(pC5, cpCll-l, 








(pC13, 295, ,SL4, 








G101,F116, 








B26, (pBS, 53, 








145, 284 & 308 






Pseudomonas cepacia 


42 & 83-24 
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Pseudomonas fragi 


psl 

wy 


27362-Bl 
27363 Bl 


The American Type Culture Collection 


Pseudomonas 
phaseolicola 


(f»6 




Felix d'Herelle Reference 
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Pseudomonas putida 


gh-1 


12633-B1 


The American Type Culture Collection 
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^6 


21781-B1 


inc /\mencan lype i^unure collection 


Pseudomonas sp. 


PPs-G3 


49780-B1 


The American Type Culture Collection 


Salmonella bareilly 


Sab 2 
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Salmonella enteritidis 


1,2,3&6 




EpidemioLInfect 1995. 1 14:227-236 




2a, 3a, 4a, 5a 5 6a, 
7a, 8a, 9a, 15, 

1 Q OA 2t1\ ** 
17, £A) CCLl. 




VetM eANauki. 1 975 . 1 2: 5 5-60 


Salmonella newington 


Epsilon 34 




J.StructBiol. 1995.115:283-289 


oaimoneua newpori 


7 11 
/-ll 

16-19 


2 /ooy-r>l 

27869-B2 


Ine American lype Culture Collection 




2,5a 
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Salmonella paratyphi 


31 

Paratyphoid A 


19940-B1 
12176-B1 


The American Type Culture Collection 




Jersey 
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Salmonella 
senftenberg 


SasLl, SaL2, Sal 

oai/i-, oaJLj oi 
SasL6 




Indian J.Med.Res. 1997.105:47-52 


Salmonella 
typhimurium 


P22** 
SL-1 


19585-B1 
40282 


The American Tvne Ciilnirp C* nllprtinn 




MB78** 




J. Virol. 1982.41: 1038-1043 




SE1 




J.Gen.Microbiol 1 986. 1 32: 1 03 5- 1 04 1 




LT2 




Virology 1971 45-835-636 




ES18** 




Virology 1970.42:621-632 




L** 




J. Virol. 1985.56:1 034- 1036 




P1CM clr-100 




Mol.Gen.Genet. 1975,138:113-126 




F22 




GenetRes. 1 986.48 : 1 39- 1 43 




Fels 1 




J.Gen.Virol.l978.38:263-272 




Fels2 




GenetRes 1986 48*139-143 




Px 




Mol. Gen. Genet. 1 970. 108:1 84-202 




Pike 




Viroloav 1974 60*503-514 




A3&A4 




J.Bacteriol. 1987.169:1003-1009 




UT 
III 




uenetKes. 1 976.27:3 1 5-322 


iV/7 ITYI n~H Otl fl 


TT? A 




j.rsasic MicroDioi. lyyu.oi): /u /- / lo 


typhimurium 


1V1UQ 1 




Mol.Uen.uenet iyoo.2Uz:3z7-3J0 


(Cont'd) 


1 &cir6-l) 




Moi.Lren.vaenet. I y 54. 1 y o : 1 U:>- 1 09 




BF23** 




Mol.Gen.Genet 1976 147-195-202 




Kbl 




J.Bacteriol. 1 974. 1 1 7:907-908 




P221dis 




J.Gen.Virol.l978.41:367-376 




PRD1** 




Virology 1990.177:445-451 




I 5 -2** 




J.Gen.Microbiol.l982.128:2797-2804 




tf-1 




J.Gen.Microbiol.l987.133:953-960 








J.Gen.Microbiol. 198 1 .126:389-396 
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Salmonella 


8 


19937-B1 


The American Type Culture Collection 




typhosa/typhi 


23 
25 

AC 

46 

53 

163 

175 

Vil 

ViVI 


19938- Bl 

19939- Bl 

1 flO/IO "D 1 

iyy4z-Bi 
19943-Bl 

1 C\C\A £. TJ 1 

19947-Bl 
27870-Bl 

9707A r>9 
Z / 0 / U-I5Z 








Ol 
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\ 7 ITT 




Chung Hua Liu Hsing Ping 

TT T P 1QQ9 13*988 










Tr TP « Mirrrhhinl 1QR1 19Q*HQ5-H400 




Salmonella sp. 


P3 

HQ- 


25957-Bl 
25957-B2 

Zjyj /-r>j 


The American Type Culture Collection 






jpyc 


ZjyD /-rS4 








Pin 

r iU 










Phi 

R34 


1 QQ45-R1 

9849-R1 

97541 




*5 




MG40 




Virnlnav 1Q6£ 34*591-530 


as? s 




P14 




Microb.Pathog. 1990.8:393-402 






PSP3 




Virology 1992.188:414 






IKC 




7f»«trnTh1 RnVtprinl 1 Q7£ 934-904 304 






P97 JPr QTsJA 




TViVftl 1QS£ 19-091 031 

j. viroi. iyoo.iz.yzi-yj 1 


?1] - 


Sphaewtilus natans 


cvri 
oiN 1 




Appi.iinvu^on.iviicroDioi. iy /y.j /. iuzd-ilou 


^| 


ufiig&iici uyoeriienue 


9 

z 


LOOO 1-13 1 


ine /vruencan lype wuiiure \^oiiecuon 






P2 


1 1456b 
11456a-Bl 








D20 


12661-B1 


Tli** Aiyiptipati Tvt^p Pnltiiff* PfVMppHrvn 






Sfll** 




Mol Microbiol 1 997 96-939-950 






QfV** 

Ol V 




Gtw 1QQ7 99-917-997 






OJ.U 




Mnl MirrnWnl 1005 1R-901 90S 






OLJ\, 




ri^rip 1QQ1 190*00 101 

oene iyyj.izy.yy-iui 




driigeiiu sonnei 


PI A** 










TTfn 




AArA RiVkl rM>«iV* 1077 1 1 -191 HI 




Shigella sp 


37 


23354-B1 


The American Type Culture Collection 




Spiroplasma citri 


SpVl 




Plasmid 1993.29:193-205 




Spiroplasma sp. 


SpVl-R8A2B 




Nucleic Acids Res. 1990.18:1293 






SpV3 




IsrJ.Med.Sci. 1987.23:429-433 






SpV4 




J.Bacteriol.l987.169:4950-4961 




Staphylococcus albus 


l to 18, 20, 21 to 
25, 27, 29 to 36 
&39 




Staphylococci & Staphylococcal 
Infections. 1997. 
Voll:503-508 (Karger,Basel) 
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Staphylococcus aureus 


1 A 


Z / /UZ-151 


ine American iype L^uinire L^oiiecuon 






27703-B1 






O 


27704-B1 








23360-B1 

~J \J\J XJ X 






1 7 


23361-B1 

£*^J *J \J X XJ X 






7Q 

zy 


27705-B1 






4ZJJ 


27712-B1 






4Zii 


27690-B1 

^- / W \J XJ X 






4 / 


2769 1-B1 

I \f *f X X^ X 






OZ 


27692-B1 






jZA 


27693-B1 

1 \JJ^J XJ X 








27694-B1 

/ \J Si XJ X 






D4 


27695-B1 

4* i \j y *j xj x 






rr 


27696-B1 






/l 


27697-R1 

^> / \jy i xj l 






7^ 

fj 


27698-B1 






77 


27699-B1 

/ \J ~s S XJ X 






7Q 


27693-B2 






oU 


27700-B1 

f f W XJ X 






C 1 


2770 1-B1 

£* f f \J X XJ x 






5JA 


27706-B1 

w f f w v X-/ X 






O A 

84 


97707-R1 






85** 


27708-R1 

£* l l \JO XJ x 






QQ 

oo 


33742 






yz 


33741-B1 








15565 






K 


19685-B1 






PI 


11987-B1 






P14 


11988-B1 








15752-B1 






44AHTD 


HFR 101 


X t'llA. LI llvlvllv XVC-ltl v-liWC 




187 


HER 239 
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2638A/2854 


HER 283 






p68 


HER 49 






Twort** 


HER 48 






<|)ii** 




IBacteriol. 1988.1 70:2409-24 1 1 




<bl3** &<b42** 




J.Gen. .Microbiol. 1989.135:1 679- 1 697 








TRrtprirtl 1 Q8.fi 1fifi-^SS ^01 








L^an. j .ivxicroDioi. i yy 0.4 j .0 1 Z-0 1 0 




y4,yj & yo 




J .Ciin.MicroDioi. l y oo.zo:zjyD-z4U 1 




<pl31,A 3 &A 5 




Staphylococci & Staphylococcal 








Iniections.iyy /. 








Voll:503-508 (Karger,Basel) 




Phi PVL** 




Gene 1998.215:57-67 


olupnyLOCOCCUS 


l3ao 1 i^Z 




renx u riereiie iveierence 


carnosus 
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Staphylococcus 


la, 2b, 3a, 4b, 




Can. J.Microbiol 1988.34:1358-1361 


epidermidis 


5a, 6b, 7b, 8c, 








9a, 10a, llb,12a 








& 13b 






SD-63311.4 









91 



248/037 





41,63, 1 1 811, 
138, 

245, 336, 392 & 
550 




Res.Virol.l994.145:l 11-121 


Staphylococcus 
saprophyticus 


1154A, 1405, 
1314, 1139 & 
1259 




Res.Virol.1990.141: 625-635 & 
Res.Virol.1994. 145: 11 1-121 


Staphylococcus sp. 


Phi 812, Phi 131, 
SK311 &U16 




Virology 1998.246:241-252 


Streptococcus faecalis 


VD13 


HER44 
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Streptococcus faecium 


PE1 




ZentralbLBakteriol. 1 975.23 1 :42 1 -425 


Streptococcus oralis 


Cp-l**&Cp- 




FEMS Microbiol.Lett.l989.65:187-192 


Streptococcus 
pneumoniae 


Cp-l** 


HER223 


Felix d'Herelle Reference 
Centre,Quebec,Quebec 


Cp-1**, Cp-5**, 
Cd-7** Cd-9** 

CD-I & (0-2 




J.Vrrol.198 1.40:55 1-559 & 

tint* T "Dirt^Vna-rrt 1 QHQ 1H1^0 Sr 

rirurj.rsiocnern. iy /y.iui.jy-04 cc 

IVIlcrOulal i-TUg JXCSlSlallCc lyy I . D . L O J- I/O 




ild-ozj o6 lid- 

746 








EJ-1** 




J.Bacteriol.l992.174:5516-5525 




Dp-2 & Dp-4 




J.Virol. 1978.26:22 1-225 




Dp-1 




Virology 1975.63:577-582 




LU-O Oc UJ - o 




J Virol 1976 19-659-667 




^ft4 








HB-l,HB-2, 
HB-3**, 
HB-4, HB-5 & 

xxd-u 




LBacteriol.l979.138:618-624 


Streptococcus 


T12** 




Mol Microbiology. 1997#23:7 19-728 


pyogenes 


A-1 

A-25 
Kjem 


12202-B1 

12204-B1 
14918 
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Streptococcus 
spJEnterococcus 


1 

182 

VD1884 


HER 339 
HER 80 
HER 323 
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1 A 
1Jl> 

IN IN 

4? 

118 
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1 0 1 £Q "R 1 
1 Z 1 / U-£> 1 

oi SQ7-R1 
1QQ48-R1 

1995 1- B2 

19952- B1 


The American Type Culture Collection 


Vpillnnplla. rodpntium 


N2 




Antonie Van Leeuwenhoek 1989 56*263-271 


Vibrio cholerae 


Psi 92 




Intervirology 1993.36:237-244 




VCB-1,2,3&4 




J.Infetion 1998.36:131 




CP-T1** 




I Virol. 1984.5 1:163-169 




VSK 




FEMS Microbiol.Lett.l996.145:17-22 




PW138 




J.Virol. 1986.57:960-967 




Phil49 




J.Virol.l985.140:217-223 
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Fs-2** 




Microbiology 1998.144:1901-1906 




e4 
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B 
















1 1 
1 J 








14 








16 








24 
























11R 


14100-R1 


TVia Amp*rir*nn Tt/hp Onltnrp f^nllppfirvn 

XI1C> xT-lllCIIWall 1 yUC \~/Lll IU.1G V^AJUCW IHJ11 


(Cont'd) 


145 


14100-R2 






149 


1 4 1 0O-FHO 






161 


14100-R4 






"M 4 








<2 5 


51159 R9 






o-ZU 


CI ICO p.'} 






IVif 4 
1VL-4 


CI OCO pyl 






D-10 


C 1 1 CI T»C 

51352-B5 






T 
1 








II 


51352-B7 






TTT 
III 


cnc o no 

51352-B8 






TAT" 

IV 


c i ico Tin 






V 


51352-B10 




Vibrio costicola 


UTAK 
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Vibrio eltor 


e 4 




j.vjen. Virol, lyo/.oo: 141 i-i4io 


rlOrlU nulrlg&Ylii 






.reiix u ncrciic xveierencc 
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• 

V ibrio 


Jfv V r4U 
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parahaemolyticus 


vrjj 
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VP1 








AAA 

<pou 








<t)HAWI-5 








<t>PEL8C-l 






Vibrio sp. 


a3a 
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11985-B1 
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51582-B1 






Phil49 
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N2 




Antonie V.Leeuwenhoek. 1989.56:263-271 
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Yersinia enterocolitica 


1 
2 
3 
4 
< 

6 

1 

8 
9 

(pYeOi-lz 
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I, IV & VIII 




ZentralbLBakteriol.Mikrobiol.Hyg.1982.253: 

1 AO 


Yersinia pestis 


R 
S 
Y 


23208-B1 
11593-B1 
23053-B1 


The American Type Culture Collection 




II 




ZhMikrobiol.Epidemiol.Irnmunobiol. 1 990. 1 1 
:9 


Yersinia 

pseudotuberculosis 


PST** 


23207-B1 


The American Type Culture Collection 


Yersinia sp. 


RD2 
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Table 2 



bacteriophage 77, complete genome sequence, 41708 nucleotides 

1 gatcaaaata cttggggaac ggttagggag taaacttcgc gataatttta aaaattcatg 

61 tataaccccc ctcttataac cattttaagg caggtgatga aatggagatt atagtcgatg 

121 aaaatttagt gcttaaagaa aaagaaaggc tacaagtatt atataaagac atacctagca 

181 ataaattaaa agtagttgat ggtttaatta ttcaagcagc aaggctacgt gtaatgcttg 

241 attacatgtg ggaagacata aaagaaaaag gtgattatga tttatttact caatctgaaa 

301 aggcgccacc atatgaaagg gaaagaccag tagccaaact atttaatgct agagatgctg 

361 catatcaaaa aataatcaaa caattatcgg atttattgcc cgaagagaaa gaagacacag 

421 aaacgccatc tgatgattac ctatgattag taataaatac gttgatgaat atataaattt 

481 gtggaaacaa ggaaagataa ttttaaataa agaaagaatt gatctcttta attatctaca 

541 aaaacatata tattcacgag atgatgtata ttttgatgaa cagaaaatcg aggattgtat 

601 caaatttatt gaaaaatggt attttccaac attaccattt caaaggttta tcatagctaa 

661 tatatttctt atagataaaa atacagatga agctttcttt acagaatttg ctattttcat 

721 gggacgtgga ggcgggaaaa acggtctaat aagtgctatt agtgattttc tttctacgcc 

781 cttacacgga gttaaagaat atcacatctc cattgttgct aatagtgaag atcaagcaaa 

841 aacatcgttt gatgaaatca gaaccgtttt aatggataac aaacgaaata agacgggtaa 

901 aacgccaaaa gctccttatg aagttagtaa agcaaaaata ataaaccgtg caactaaatc 

961 ggttattcga tataacacat caaacacaaa aaccaaagac ggtggacgtg aggggtgtgt 

1021 tatttttgat gaaattcatt atttctttgg tcctgaaatg gtaaacgtca aacgtggtgg 

1081 attaggtaaa aagaaaaata gaagaacgtt ttatataagt actgatggtt ttgttagaga 

1141 gggttatatc gatgcaatga agcacaaaat tgcaagtgta ttaagtggca aggttaaaaa 

1201 tagtagattg tttgcttttt attgtaagtt agacgatcca aaagaagttg atgacagaca 

1261 gacgtgggaa aaggcgaacc caatgttaca taaaccgtta tcagaatacg ctaaaacact 

1321 gctaagcacg attgaagaag aatataacga tttaccattc aaccgttcaa ataagcccga 

1381 attcatgact aagcgaatga atttgcctga agttgacctt gaaaaagtaa tagcaccatg 

1441 gaaagaaata ctagcgacta atagagagat accaaattta gataatcaaa tgtgtattgg 

1501 tggtttagac tttgcaaaca ttcgagattt tgcaagtgta gggctattat tccgaaaaaa 

1561 cgatgattac atttggttag gacattcgtt tgtaagacaa gggtttttgg atgatgtcaa 

1621 attagaacct cctattaaag aatgggaaaa aatgggatta ttgaccattg tcgatgatga 

1681 tgtcattgaa attgaatata tagttgattg gtttttaaag gctagagaaa aatatgggct 

1741 tgaaaaagtc atagctgata attatagaac tgatattgta agacgtgcgt ttgaggatgc 

1801 tggcataaaa cttgaagtac ttagaaatcc aaaagcaata catggattac ttgcaccacg 

1861 tatcgataca atgtttgcga aacataacgt aatatatgga gacaatcctt tgatgcgttg 

1921 gtttactaat aatgttgctg taaaaatcaa gccggatgga aataaagagt atatcaaaaa 

1981 agatgaagtc agacgtaaaa cggatggatt catggctttt gttcacgcat tatatagagc 

2041 agacgatata gtagacaaag acatgtctaa agcgcttgat gcattaatga gtatagattt 

2101 ctaatagagg aggtgagaca tgagtattct agaaaagata tttaaaacta ggaaagatat 

2161 aacatatatg cttgatttag atatgataga agatctatca caacaagcgt atgtgaaacg 

2221 tttagcgatt gatagttgta ttgaatttgt tgcgcgagct gtcgctcaaa gtcattttaa 

2281 agtattggaa ggtaatagaa ttcaaaagaa tgatgtttac tacaagttaa atataaaacc 

2341 aaatactgac ttatcaagcg atagtttttg gcaacaagtt atatataaac taatttatga 

24 01 taacgaggtt ttaatcgtag taagtgacag caaagaatta cttatcgcag atagctttta 

2461 cagagaagag tacgctttgt atgatgatat attcaaagat gtaacggtta aagattatac 

2521 ttatcaacgt actttcacaa tgcaagaggt catatattta aagtacaaca acaataaagt 

2581 gacacacttt gtagaaagtc tattcgaaga ttacgggaaa atattcggaa gaatgatagg 

2641 tgcacaatta aaaaactatc aaataagagg gattttgaaa tctgcctcta gcgcatatga 

2701 cgaaaagaat atagaaaaat tacaagcgtt cacaaataaa ttattcaata cttttaataa 

2761 aaatcaacta gcaatcgcgc ctttgataga aggttttgat tatgaggaat tatctaatgg 

2821 tggtaagaat agtaacatgc ctttttctga attgagtgag ctaatgagag atgcaataaa 

2881 aaatgttgcg ttgatgattg gtatacctcc aggtttgatt tacggagaaa cagctgattt 

2941 ggaaaaaaac acgcttgtat ttgagaagtt ctgtttaaca cctttattaa aaaagattca 

3001 gaacgaatta aacgcgaaac tcataacaca aagcatgtat ttgaaagata caagaataga 

3061 aattgtcggt gtgaataaaa aagacccact tcaatatgct gaagcaattg acaaacttgt 

3121 aagttctggt tcatttacaa ggaatgaggt gcggattatg ttaggtgaag aaccatcaga 

3181 caatcctgaa ttagacgaat acctgattac taaaaactac gaaaaagcta acagtggtga 

3241 aaatgatgaa aaagaaaaag atgaaaacac tttgaaaggt ggtgatgaag atgaaagcgg 

3301 agattaaagg cgtcatcgtt tccaacgaag ataaatgggt ttacgaaatg cttggtatgg 

33 61 attcgacttg tcctaaagat gttttaacac aactagaatt tagtgatgaa gatgttgata 

3421 ttataattaa ctcaaatggt ggtaacctag tagctggtag tgaaatatat acacatttaa 

3481 gagctcataa aggcaaagtg aatgttcgta tcacagcaat agcagcaagt gcggcatcgc 

3 541 ttatcgcaat ggctggtgac cacatcgaaa tgagtccggt tgctagaatg atgattcaca 
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3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 



atccttcaag 
aacatgttgg 
aacttataga 
gttttgcgga 
aagtgttatc 
ttaacattga 
aggaatcaga 
ttttttaata 
aatgcgaaaa 
gaattgtacg 
gaagctgaaa 
aatttcttta 
gaaacaattg 
ggtattaaaa 
gtttggggta 
acagcaattc 
ggtcctgcgt 
cttgaaactg 
gtacaaaaag 
cttacatttg 
tcaactaacg 
ccgtccgatg 
gttactgctt 
gttttaacgt 
aaatttaaag 
tacggcaaag 
aaaccagctt 
gaaatttaaa 
aggggagttg 
aatcaaaaat 
attattagaa 
aatcatcgac 
aatcacttga 
tgtcgtacga 
aagaattgat 
acaattacag 
aagaaagtgt 
tataagtata 
agctgttggg 
ggaacgcaaa 
gaagaacatt 
gtatcaccag 
gtgtgaaagt 
aagagatggt 
aaataaaaaa 
gtactgaacc 
ttgaacgatt 
aatttgtaaa 
agtattttga 
tcatgaagtg 
gttcaataaa 
cgacgaccca 
ccaaatagat 
gatatctaat 
tggaaaaccg 
cattttttat 
tattaacatt 
tagtgatatt 
aaaaacagct 
aatctcatta 
ttatgatgaa 
atggttcaga 
gtttacaaat 
agaggttgaa 
tatctttgat 
tttcttaaag 
aactttgtaa 



tattgcgcaa 
tcaaataatg 
aatgatggct 
tagtaaaatg 
gaaagatgta 
tattgacgca 
aatcgatgtt 
caaaaatagg 
acgaatttat 
gtgacatgat 
gagtttctag 
tggatatcaa 
atagaatctt 
atgctggttt 
aaatctatgg 
aaaataaatt 
ggattgaaag 
cgttcttaaa 
gtgtatcggt 
ctaatccgcg 
agaaaggtaa 
cttttgaggt 
taccatttaa 
acgttaaagg 
aaacacttgc 
cgaaagataa 
tagaagatac 
gttgttagag 
tatccagctg 
aagtacgaca 
ctatgcgaat 
ttattgaatg 
aaagattgac 
gcgtataaaa 
acttatacgc 
acctgaaata 
ttaagaaacc 
ctgaaaataa 
cgagtattga 
atgacattaa 
atcttgaaat 
atttggataa 
gacaggtgat 
aaaagttcaa 
acaactcaaa 
tgaatggata 
tagaatagta 
acctaaagct 
gacgctaaaa 
attagtcaag 
taccctaatg 
atacctacaa 
gtttttgtta 
cgcattcaaa 
gaatatatag 
aaggaggaaa 
actggtttag 
acaaaaacaa 
tatgctgatg 
caaatgcatg 
gatggcgttt 
caagagcgta 
cctaaaatcg 
ggtgaggcac 
tcagctaaca 
aaaattttag 
caaaaccggc 



ggagaagtga 
gctgaggcat 
aaggaaacgt 
tttgaaaacg 
ttaaatcgtg 
atagcaaata 
gcagatagta 
aggtcataaa 
taatgcagta 
taaccaacta 
tttacctaaa 
taagagtgtt 
cgaagattta 
gcgtttgaag 
tgaaattaaa 
gacagcgttt 
atttgttcgt 
aggtactggt 
aactgatggt 
cgctacggtt 
atcagtagcg 
tcaagcacag 
tttgaatgtt 
tctatatgat 
gttagatgat 
taaagttgct 
cgaagaaaca 
aatttaaaga 
aagggtataa 
aagtttatat 
cattacaaaa 
gtgaagacaa 
cataattcag 
aatcagtgcg 
gctagatatg 
atagattttt 
tagaattaca 
tggtccagaa 
tggtgtctgg 
attgtatatt 
tgaatcaaga 
taaagacttt 
aaagcattag 
gataaggcgt 
ccttcagaag 
aaggggaaac 
catttaattg 
atgggtggga 
agggagttga 
acagaattat 
taaaagatac 
cttatactga 
agtacaatga 
agttattatg 
aagaatttaa 
attaaatggc 
gtttcgctaa 
gaggattaca 
gcggtccaat 
cgttccctaa 
acgaagagaa 
aagacggtac 
atggagaaac 
ttttcccttt 
tgacaaatca 
gcgaagaata 
ttcatcggaa 



aagatctaaa 
atgcggttag 
ggctaaatgc 
acaatatgca 
taacagcttt 
aagtaattga 
aattatcagc 
atgactataa 
aacaacggtg 
tttgaagaaa 
tcagcacaaa 
ggatataaag 
acaacgaatc 
ttcttaaaat 
ggtcaattag 
gttgttttac 
gttcaaatcg 
aaagaccaac 
gcttatccag 
aatgaattga 
gttaaaggta 
tatacacatt 
attgagtcta 
ggttatttag 
atggatttat 
gctgtttgga 
ctataaaatt 
catagagcac 
caatcctcgt 
cgtaccttta 
aaaagcgtct 
tgacgattga 
aggatgagta 
gagtttttga 
cttatcaaga 
cgttatctct 
actaaacgtt 
gctggagaaa 
ttacgtgaat 
cgtgatccgc 
tatttcaaaa 
attatgattc 
aaagagaatt 
taatagctgg 
actcaggagc 
gtactgttac 
aaaatggtca 
ttaatagagc 
aaaaattgtg 
tagagagcac 
tgatgtacct 
cggagatgag 
tgaatataat 
gtctgaacta 
aacatataga 
agtaaaacat 
attaacgaaa 
aaaaattggt 
tgaatcaggg 
agagattcgc 
acaaggtaaa 
atttagaaca 
ggctgagaaa 
agttgataat 
tgatggagac 
tactggaaac 
actgcggtaa 



tcatgctgca 
agctggtaaa 
tgatgaagcc 
aattgtagca 
ggtaagtaaa 
aaaaataaat 
aaatggattt 
atttatcgga 
aaccgcaaga 
ctaaattaca 
ctttgagtgc 
aagaaaaact 
atccattatt 
ccgaaacttc 
atgctgcgtt 
caaaagattt 
aagaagcatt 
cgattggctt 
agaaagaaga 
cgcaagtgtt 
atgtaacaat 
taaatgcaaa 
cagttcaaga 
ctggtggtat 
acactgcaaa 
aattagattt 
ttatgaggtg 
aatcaacaca 
gttgaattgt 
gataagctga 
agttcaatgg 
tgatttgctt 
cttaaagcag 
attagagaat 
tttattagaa 
aatggaggta 
taaatacgcg 
aagaagaaaa 
tagaacaagc 
aaggtgatta 
atcgtttgaa 
gcggaggata 
agaaaaacat 
tgctaaggta 
actgattagt 
aattaggtgg 
tgttgagaaa 
aataagacaa 
attgatattt 
gtaaatatca 
tttattgtta 
tgtgcatata 
gcgagaatca 
aaaatgggaa 
agctctcgcg 
gcaagtgcgc 
gaaggcgcgg 
gttgaaactg 
aatacagacg 
aaaattgttt 
caaaacaatt 
gttttattac 
gattgggatt 
aaaaagtcag 
ggtgaaaaag 
gtgacagagg 
agtcggttaa 



gaaacattag 
aacaaacaag 
attgaacaag 
agcgatacac 
acgccagagg 
atgaaagaaa 
tcaagattcc 
aacattcgca 
aagacaaaat 
agcaaaagca 
aaaccaaaga 
tttaccagaa 
agctgactta 
tggcgtggct 
cagtgaagaa 
aaatgatttt 
tgcagtggcg 
aaaccgtcaa 
acaaggtacg 
taaataccac 
ggttgttaat 
tggcgtatat 
agcaggtaag 
taatgttcag 
acaatttgct 
aaaaggacat 
ataaaatggt 
agtacaaagt 
tgacaaatca 
caaaacaaga 
ttaaaagtga 
gtcaaattta 
ttgttaaaaa 
ttaataggtc 
cacttcaacg 
tcagaagatg 
tgttcatttt 
attattatat 
tatctcaaac 
tttacccagt 
tataaagcaa 
tagttcatga 
tttggcataa 
attgttgaag 
gagattggtc 
cgtgggcctt 
aagtcaggaa 
gggcaaaata 
tgtacaaagt 
ataatattaa 
ttgacgatat 
gttatattgt 
taagaaataa 
atgtttcaaa 
tttacgaggg 
caaaggcgta 
aattaaaata 
gtggagaact 
gagaaggtaa 
ttaatgaaga 
acgtagctgt 
ctaaagttat 
tctcaagtga 
tacgtaagta 
gcgaagaggc 
gtaacgaaga 
tataccagat 
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7621 agcattaaaa cacttaaagt tggcgacaca 

7681 aatcaaagta agttattgaa atacacaaca 

7741 gatggtcaag ttactgcgga agcacaaggc 

7801 atgagtgaca ctataacaat aaatgtagaa 

7861 ttgaaaataa ggagagtatt ataaaatggc 

7921 agaagatcca aaagcaaatg aaattaaatt 

7981 atttgaaatt gtatacgaag caatggattt 

8041 gatgaagcca agagaaatcg ctgacagatt 

8101 ccaattcaca gttaaagacc taaaagaacg 

8161 tcgtgaacaa gtgattttca ttactcaagg 

8221 ccagaacatg aaataaagcc tgaagattta 

8281 actctcatga tggacttaat tgaaaatggt 

8341 tttcattatg tgctttccat atatcaaaat 

8401 gaggctttaa ttgatgcatt ttaaccttaa 

8461 ttttagaaag gaggtaaaaa atgggagaaa 

8521 tagatgcagc aaatttaaat agatcatttg 

8581 attctgactt aaaattaaca ggcaacaact 

8641 acaaacaaag gattaaagaa cttgatggaa 

8701 atttagccaa gcaatatgac aaggtatctc 

8761 aaaagttacg acaagaatat aacaaacaag 

8821 tacaaaaaac atcagccgaa tttgaagagt 

8881 tggcagaaag tggctgggga aaaaccagta 

8941 caaaaatggg tgatggttta aaatccattg 

9001 ctgttttagg tattgcagca gcatcaggaa 

9061 atactgttac tcaagcaaca ggcgcaacag 

9121 ttaaagatgt ttatggcaat tttccagcag 

9181 aagttaatac aaggttaggt tttacaggta 

9241 tgaaattcag tcatataaca ggttctgacg 

9301 caatgggcga tgcaggtatc gaagcaagtg 

9361 aagcggcgca agctagtggg ataagtgttg 

9421 gcgctccaat gagagctatg ggctttgaga 

94 81 gggaaaagtc aggcgttaat actgaaatag 

9541 attggggtaa agctggtaaa aacccaagag 

9601 aaaagacgcc ggatatagct agcgcaacaa 

9661 caggtcctga tttagcagac gctattaaag 

9721 aaactattga agattcccaa ggcacagtaa 

9781 ccgaaagatt taaagtagca atgaataaat 

9841 ctattgaaag tgcgtttgct cccgtaatgg 

9901 ttgattggtt ttccaattta agtgatggtt 

9961 ttgctgctgc aattggtcct gtagtttttg 

10021 atgcagtaac tgtattagct ccattgttag 

10081 gttttttatc gactaaagta cctatattag 

10141 ttggcattgt attaggtgta ttggctggtt 

10201 aatctgaaac atttagaaat tttgttaatg 

10261 gtaattttat tcaatttatt caacctttcg 

10321 cgatatcagc aatagttgat ttcgcaaaag 

10381 atgaaaacgg aatttccatt gttcaagcac 

10441 tatttgaatt tattttaaat tttgtaatta 

10501 tgcaatttat ttggccggcg gttaaagcct 

10561 gtgtaataca aggtgcttta aatatcatac 

10621 tcgttggtga ttggcgagga gtttgggacg 

10681 aattaatttg gaatttagtt caattatggt 

10741 actttggcgg gttgctaaaa ggattgatag 

10801 tcagtaaatc tttatcagca atttggaatg 

10861 atagcgtaaa atcaattttc acaaatatga 

10921 tccgtacgaa tacaatagga aaagcgcagt 

10981 ctaatttatg gaatgcgacg aaagaaattt 

11041 tttggaattc cattaaagat aatacggtag 

11101 gtggaatttt cacaaatatg cgcgatggct 

11161 atatcggcgg tatggtaagc gctattaaaa 

11221 actgggtcgg tggtaagttg ggaatggata 

11281 acacacatac tactacaaga ttagttaaga 

11341 cagttgggga taagggacgc ggaaatggtc 

11401 tccctaacgg taaacgtgta atcacaccta 

11461 gctcaaaagt atacaacggt gcacaaactt 

11521 ttagtttagg tactatgtgg aaagatatta 

11581 caaaagataa aataggtaaa ggtaccaaat 
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tacgatttaa atgttgtagt agagccatct 
gatcaaacga atattgtatc aatcaatagt 
attgctacgg ttaaagcaac agttggtaat 
gcataagagg gggcaacccc tctattttat 
aaaattaaaa cgtaacatta ttcaattagt 
acaaacgtac ttaacaccac acttcatttc 
aatcgatgat attgaggacg aaaatagcac 
gatggatatg gttgtaaaaa tttacgataa 
tatgcatgca cctgatggaa tgaatgcact 
tcaacaaact gaggaaacta gaaattttat 
acatataaag caatgttgaa aaatatggat 
aaagacgcta acgaagtttt aaaaatgcca 
aaaaataatg acatttctga agaaaaagca 
ccgtttggtt agggttattt ttttgaactt 
gaataaaagg tttatctata ggtttggatt 
cagaaatcaa acgaaacttt aaaactttaa 
tcaaatatac cgaaaaatca actgatagtt 
ctatcacagg ttataagaaa aacgttgatg 
aagaacaggg cgaaaacagt gcagaagctc 
caaatgagct gaattattta gaaagagaat 
tcaaaaaagc tcaagttgaa gctcaaagaa 
aagtttttga aagtatggga cctaaattaa 
gtaaaggttt gatgattggt gtaactgcac 
aagcttttgc agaagttgat aaaggtttag 
gcagtgaatt aaaaaaattg cagaactcat 
atgctgaaac tgttggtgga gttttaggag 
aagaacttga aaatgccaca gagtcattct 
gtgtgcaagc cgtacagtta attacccgtg 
aatatcaaag tgttttggat atggtagcaa 
atacattagc tgatagtatt actaaatacg 
tgaaagaatc aattgcttta ttctctcaat 
cattcagtgg tttgaaaaaa gctatatcaa 
aagaatttaa gaagacatta gcagaaattg 
gtttagcgat tgaagcattt ggtgcaaagg 
gtggtcgctt tagttatcaa gaatttttaa 
accaaacatt taaagattct gaaagtggct 
taaaattagt aggtgctgat gtatgggctt 
aagaattaat caaaaagcta tctatagcgg 
ctaaaagatc aattgttatt ttcagtggta 
ggttaggtgc atttataagt acaattggca 
ctagtattgc aaaggctggt ggattgatta 
gaactgtctt cacagcttta actggtccaa 
tagcagtcgc atttacaatt gcttataaga 
gtgcaattga aagtgttaaa caaacattta 
ttgattctgt taaaaacatc tttaaacaag 
atatttggag tcaaatcaat ggattcttta 
ttcaaaatat atgcaacttt attaaagcga 
aaccaattat gttcgcgatt tggcaagtga 
tgattgtcag tacttgggag aacataaaag 
ttggcttgat taagttcttc tcaagtttat 
ccgttgtgat gattcttaaa ggagcagttc 
ttgtaggtaa aatacttggt gttgttaggt 
caggaatttg ggacgtaata agaagtatat 
caacaaaaag tatttttgga tttttattta 
aaaattggtt atctaatact tggagcagta 
cattatttag tggcgtcaaa tcaaaattta 
ttagtaattt aagaaattgg atgtcaaata 
gaattgcaag ccgtttatgg agtaaggtac 
tgagttccat tatagataag attaaaagtc 
aaggacttaa taaattaatc gacggtttaa 
aaatacctaa gttacacact ggtacagagc 
acggtaagat tgcacgtgac acattcgcta 
caaatggttt tagaaatgaa atgattgaat 
atacagatac taccgcttat ttacctaaag 
attcaatgtt aaacggaacg cttccaagat 
aatctggtgc atcatcggca tttaactgga 
ggcttggcga taaagttggc gatgttttag 
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11641 attttatgga aaatccaggc aaacttttaa attatatact tgaagctttt ggaattgatt 

11701 tcaattcttt aactaaaggt atgggaattg caggcgacat aacaaaagct gcatggtcta 

11761 agattaagaa aagtgctact gattggataa aagaaaattt agaagctatg ggcggtggcg 

11821 atttagtcgg cggaatatta gaccctgaca aaattaatta tcattatgga cgtaccgcag 

11881 cttataccgc tgcaactgga agaccatttc atgaaggtgt cgattttcca tttgtatatc 

11941 aagaagttag aacgccgatg ggtggcagac ttacaagaat gccatttatg tctggtggtt 

12001 atggtaatta tgtaaaaatt actagtggcg ttatcgatat gctatttgcg catttgaaaa 

12061 actttagcaa atcaccacct agtggcacga tggtaaagcc cggtgatgtt gttggtttaa 

12121 ctggtaatac cggatttagt acaggaccac atttacattt tgaaatgagg agaaatggac 

12181 gacattttga ccctgaacca tatttaagga atgctaagaa aaaaggaaga ttatcaatag 

12241 gtggtggcgg tgctacttct ggaagtggcg caacttatgc cagtcgagta atccgacaag 

12301 cgcaaagtat tttaggtggt cgttataaag gtaaatggat tcatgaccaa atgatgcgcg 

12361 ttgcaaaacg tgaaagtaac taccagtcaa atgcagtgaa taactgggat ataaatgctc 

12421 aaagaggaga cccatcaaga ggattattcc aaatcatcgg ctcaactttt agagcaaacg 

12481 ctaaacgtgg atatactaac tttaataatc cagtacatca aggtatctca gcaatgcagt 

12541 acattgttag acgatatggt tggggtggtt ttaaacgtgc tggtgattac gcatatgcta 

12601 caggtggaaa agtttttgat ggttggtata acttaggtga agacggtcat ccagaatgga 

12661 ttattccaac agatccagct cgtagaaatg atgcaatgaa gattttgcat tatgcagcag 

12721 cagaagtaag agggaaaaaa gcgagtaaaa ataagcgtcc tagccaatta tcagacttaa 

12781 acgggtttga tgatcctagc ttattattga aaatgattga acaacagcaa caacaaatag 

12841 ctttattact gaaaatagca caatctaacg atgtgattgc agataaagat tatcagccga 

12901 ttattgacga atacgctttt gataaaaagg tgaacgcgtc tatagaaaag cgagaaaggc 

12961 aagaatcaac aaaagtaaag tttagaaaag gaggaattgc tattcaatga tagacactat 

13021 taaagtgaac aacaaaacaa ttccttggtt gtatgtcgaa agagggtttg aaataccctc 

13081 ttttaattat gttttaaaaa cagaaaatgt agatggacgt tcggggtcta tatataaagg 

13141 gcgtaggctt gaatcttata gttttgatat acctttggtg gtacgtaatg actatttatc 

13201 tcacaacggc attaaaacac atgatgacgt cttgaatgaa ttagtaaagt tttttaacta 

13261 cgaggaacaa gttaaattac aattcaaatc taaagattgg tactggaacg cttatttcga 

13321 aggaccaata aagctgcaca aagaatttac aatacctgtt aagttcacta tcaaagtagt 

13381 actaacagac ccttacaaat attcagtaac aggaaataaa aatactgcga tttcagacca 

13441 agtttcagtt gtaaatagtg ggactgctga cactccttta attgttgaag cccgagcaat 

13501 taaaccatct agttacttta tgattactaa aaatgatgaa gattatttta tggttggtga 

13561 tgatgaggta accaaagaag ttaaggatta catgcctcct gtttatcata gtgagtttcg 

13621 tgatttcaaa ggttggacta agatgattac tgaagatatt ccaagtaatg acttaggtgg 

13681 taaggtcggc ggtgactttg tgatatccaa tcttggcgaa ggatataaag caactaattt 

13741 tcctgatgca aaaggttggg ttggtgctgg cacgaaacga gggctcccta aagcgatgac 

13801 agattttcaa attacctata aatgtattgt tgaacaaaaa ggtaaaggtg ccggaagaac 

13861 agcacaacat atttatgata gtgatggtaa gttacttgct tctattggtt atgaaaataa 

13921 atatcatgat agaaaaatag gacatattgt tgttacgttg tataaccaaa aaggagaccc 

13981 caaaaagata tacgactatc agaataaacc gataatgtat aacttggaca gaatcgttgt 

14041 ttatatgcgg ctcagaagag taggtaataa attttctatt aaaacttgga aatttgatca 

14101 cattaaagac ccagatagac gtaaacctat tgatatggat gagaaagagt ggatagatgg 

14161 cggtaagttt tatcagcgtc cagcttctat catagctgtc tatagtgcga agtataacgg 

14221 ttataagtgg atggagatga atgggttagg ttcattcaat acggagattc taccgaaacc 

14281 gaaaggcgca agggatgtca ttatacaaaa aggtgattta gtaaaaatag atatgcaagc 

14341 aaaaagtgtt gtcatcaatg aggaaccaat gttgagcgag aaatcgtttg gaagtaatta 

14401 tttcaatgtt gattctgggt acagtgaatt aatcatacaa cctgaaaacg tctttgatac 

14461 gacggttaaa tggcaagata gatatttata gaaaggagat gagagtgtga tacatgtttt 

14521 agattttaac gacaagatta tagatttcct ttctactgat gacccttcct tagttagagc 

14581 gattcataaa cgtaatgtta atgacaattc agaaatgctt gaactgctca tatcatcaga 

14641 aagagctgaa aagttccgtg aacgacatcg tgttattata agggattcaa acaaacaatg 

14701 gcgtgaattt attattaact gggttcaaga tacgatggac ggctacacag agatagaatg 

14761 tatagcgtct tatcttgctg atataacaac agctaaaccg tatgcaccag gcaaatttga 

14821 gaaaaagaca acttcagaag cattgaaaga tgtgttgagc gatacaggtt gggaagtttc 

14881 tgaacaaacc gaatacgatg gcttacgtac tacgtcatgg acttcttatc aaactagata 

14941 tgaagtttta aagcaattat gtacaaccta taaaatggtt ttagattttt atattgagct 

15001 tagctctaat accgtcaaag gtagatatgt agtactcaaa aagaaaaaca gcttattcaa 

15061 aggtaaagaa attgaatatg gtaaagattt agtcgggtta actaggaaga ttgatatgtc 

15121 agaaatcaaa acagcattaa ttgctgtggg acctgaaaat gacaaaggga agcgtttaga 

15181 gctagttgtg acagatgacg aagcgcaaag tcaattcaac ctacctatgc gctatatttg 

15241 ggggatatat gaaccacaat cagatgatca aaatatgaat gaaacacgat taagttcttt 

15301 agccaaaaca gagttaaata aacgtaagtc ggcagttatg tcatatgaga ttacttctac 

15361 tgatttggaa gttacgtatc cgcacgagat tatatcaatt ggcgatacag tcagagtaaa 

15421 acatagagat tttaacccgc cattgtatgt agaggcagaa gttattgctg aagaatataa 

154 81 cataatttca gaaaatagca catatacatt cggtcaacct aaagagttca aagaatcaga 

15541 attacgagaa gagtttaaca agcgattgaa cataatacat caaaagttaa acgataatat 

15601 tagcaatatc aacactatag ttaaagatgt tgtagatggt gaattagaat actttgaacg 
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15661 caaaatacac aaaagtgata caccgccaga aaatccagtc aatgatatgc tttggtatga 

15721 tacaagtaac cctgatgttg ctgtcttgcg tagatattgg aatggtcgat ggattgaagc 

15781 aacaccaaat gatgttgaaa aattaggtgg tataacaaga gagaaagcgc tattcagtga 

15841 attaaacaat atttttatta atttatctat acaacacgct agtcttttgt cagaagctac 

15901 agaattactg aatagcgagt acttagtaga taatgatttg aaagcggact tacaagcaag 

15961 tttagacgct gtgattgatg tttataatca aattaaaaat aatttagaat ctatgacacc 

16021 cgaaactgca acgattggtc ggttggtaga tacacaagct ttatttcttg agtatagaaa 

16081 gaaattacaa gatgtttata cagatgtaga agatgtcaaa atcgccattt cagatagatt 

16141 taaattatta cagtcacaat acactgatga aaaatataaa gaagcgttgg aaataatagc 

16201 aacaaaattt ggtttaacgg tgaatgaaga tttgcagtta gtcggagaac ctaatgttgt 

16261 taaatcagct attgaagcag ctagagaatc cacaaaagaa caattacgtg actatgtaaa 

16321 aacatcggac tataaaacag acaaagacgg tattgttgaa cgtttagata ctgctgaagc 

16381 tgagagaacg actttaaaag gtgaaatcaa agataaagtt acgttaaacg aatatcgaaa 

16441 cggattggaa gaacaaaaac aatatactga tgaccagtta agtgatttgt ccaataatcc 

16501 tgagattaaa gcaagtattg aacaagcaaa tcaagaagcg caagaagctt taaaatcata 

16561 cattgatgct caagatgatc ttaaagagaa ggaatcgcaa gcgtatgctg atggtaaaat 

16621 ttcggaagaa gagcaacgcg ctatacaaga tgctcaagct aaacttgaag aggcaaaaca 

16681 aaacgcagaa ctaaaggcta gaaacgctga aaagaaagct aatgcttata cagacaacaa 

16741 ggtcaaagaa agcacagatg cacagaggaa aacattgact cgctatggtt ctcaaattat 

16801 acaaaatggt aaggaaatca aattaagaac tactaaagaa gagtttaatg caaccaatcg 

16861 tacactttca aatatattaa acgagattgt tcaaaatgtt acagatggaa caacaatcag 

16921 atatgatgat aacggagtgg ctcaagcttt gaatgtgggg ccacgtggta ttagattaaa 

16981 tgctgataaa attgatatta acggtaatag agaaataaac cttcttatcc aaaatatgcg 

17041 agataaagta gataaaaccg atattgtcaa cagtcttaat ttatcaagag agggtcttga 

17101 tatcaatgtt aatagaattg gaattaaagg cggtgacaat aacagatatg ttcaaataca 

17161 gaatgattct attgaactag gtggtattgt gcaacgtact tggagaggga aacgttcaac 

17221 agacgatatt tttacgcgac tgaaagacgg tcacctaaga tttagaaata acaccgctgg 

17281 cggttcactt tatatgtcac attttggtat ttcgacttat attgatggtg aaggtgaaga 

17341 cggtggttca tctggtacga ttcaatggtg ggataaaact tacagtgata gtggcatgaa 

17401 tggtataaca atcaattcct atggtggtgt cgttgcacta acgtcagata ataatcgggt 

17461 tgttctggag tcttacgctt catcgaatat caaaagcaaa caggcaccgg tgtatttata 

17521 tccaaacaca gacaaagtgc ctggattaaa ccgatttgca ttcacgctgt ctaatgcaga 

17581 taatgcttat tcgagtgacg gttatattat gtttggttct gatgagaact atgattacgg 

17641 tgcgggtatc aggttttcta aagaaagaaa taaaggtctt gttcaaattg ttaatggacg 

17701 atatgcaaca ggtggagata caacaatcga agcagggtat ggcaaattta atatgctgaa 

17761 acgacgtgat ggtaataggt atattcatat acagagtaca gacctactgt ctgtaggttc 

17821 agatgatgca ggagatagga tagcttctaa ctcaatttat agacgtactt attcggccgc 

178 81 agctaatttg catattactt ctgctggcac aattgggcgt tcgacatcag cgcgtaaata 

17941 caagttatct atcgaaaatc aatataacga tagagatgaa caactggaac attcaaaagc 

18001 tattcttaac ttacctatta gaacgtggtt tgataaagct gagtctgaaa ttttagctag 

18061 agagctgaga gaagatagaa aattatcgga agacacctat aaacttgata gatacgtagg 

18121 tttgattgct gaagaggtgg agaatttagg attaaaagag tttgtcacgt atgatgacaa 

18181 aggagaaatt gaaggtatag cgtatgatcg tctatggatt catcttatcc ctgttatcaa 

18241 agaacaacaa ctaagaatca agaaattgga ggagtcaaag aatgcaggat aacaaacaag 

18301 gattacaagc taatcctgaa tatacaattc attatttatc acaggaaatt atgaggttaa 

18361 cacaagaaaa cgcgatgtta aaagcgtata tacaagaaaa taaagaaaat caacaatgtg 

18421 ctgaggaaga gtaatcctta gcactatttt tatacaaaaa tttaaggagg tcatttaatt 

18481 atggcaaaag aaattatcaa caatacagaa aggtttattt tagtacaaat cgacaaagaa 

18541 ggtacagaac gtgtagtata tcaagatttc acaggaagtt ttacaacttc tgaaatggtt 

18601 aaccatgctc aagattttaa atctgaagaa aacgctaaga aaattgcgga gacgttaaat 

18661 ttgttatatc aattaactaa caaaaaacaa cgtgtgaaag tagttaaaga agtagttgaa 

18721 agatcagatt tatctccaga ggtaacagtt aacactgaaa cagtatgaaa agctatgagt 

18781 tagatactca tagtctttat tcttttagaa agcgggtgta ctgaattggg gtggttcaaa 

18841 aaacacgaac atgaatggcg catcagaagg ttagaagaga atgataaaac aatgctcagc 

18901 acactcaacg aaattaaatt aggtcaaaaa acccaagagc aagttaacat taaattagat 

18961 aaaaccttag atgctattca aaaagaaaga gaaatagatg aaaagaataa gaaagaaaat 

19021 gataagaaca tacgtgatat gaaaatgtgg gtgcttggtt tagttgggac aatatttggg 

19081 tcgctaatta tagcattatt gcgtatgctt atgggcatat aagagaggtg attaccatgt 

19141 tcggattaaa ttttggagct tcgctgtgga cgtgtttctg gtttggtaag tgtaagtaat 

19201 agttaagagt cagtgcttcg gcactggctt tttattttgg ataaaaggag caaacaaatg 

19261 gatgcaaaag taataacaag atacatcgta ttgatcttag cattagtaaa tcaattctta 

19321 gcgaacaaag gtattagccc aattccagta gacgatgaaa ctatatcatc aataatactt 

19381 actgtagtcg ctttatatac aacgtataaa gacaatccaa catctcaaga aggtaaatgg 

19441 gcaaatcaaa aattaaagaa atataaagct gaaaataagt atagaaaagc aacagggcaa 

19501 gcgccaatta aagaagtaat gacacctacg aatatgaacg acacaaatga tttagggtag 

19561 gtggttgata tatgttaatg acaaaaaatc aagcagaaaa atggtttgac aattcattag 

19621 ggaaacaatt caacccagat ggttggtatg gatttcagtg ttatgattac gccaatatgt 
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19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 
23401 
23461 
23521 
23581 
23641 



tctttatgtt 
ataataaagc 
cgcaaaagtt 
aaattgttga 
aaggttggac 
ttcattatta 
ttggcaataa 
aacctaaaaa 
acggaacaaa 
taagacatgc 
atcaagatac 
ttaaatcaca 
caagtggtgg 
tacaagatgt 
tactaaatgt 
ttattactaa 
taatagccgg 
catcagctaa 
atgtccctta 
taagagacgg 
ttacgtatga 
gtggacaacg 
gttttggtaa 
tatagggaat 
tttttaacat 
tattttttta 
catcaactat 
gatagagagc 
taacagttta 
tagccgggca 
cactcgatac 
tcgatacggt 
attgtggtta 
caaacgcttg 
tttggacgct 
cttgtgttaa 
aaaaaagggc 
taactctctg 
tgtatgtcct 
tatgtgtgta 
agctgaggac 
gaatataaac 
cattattttt 
tgcggtctta 
gccattaata 
ctcacctatg 
agctctatac 
catctctaaa 
cttctttggt 
ggcgtattta 
tacgtttgat 
taaattttga 
cgttacttta 
aaaagtcaaa 
ttctaaacga 
tacacgtttc 
ttcattgttc 
aaaaaataat 
aaaatacaga 
aatatatacg 
agggatctgc 
tattactgga 
ctggaaagaa 
tgatactatg 
atcagatata 
ttttttatcg 
aaacaatctt 



agcgacaggc 
aaagattgaa 
ggatattgtc 
gagcgcaaat 
taatggcgtt 
tgacaatcca 
agctaaaggt 
aattatgctt 
cgaacgcgat 
aggacatgaa 
tgcatacggt 
ggggtatgac 
gcatgttatt 
tattaaaaat 
taatgtatca 
taaaaatgat 
tgcgattcat 
aaacaaaaaa 
taaaaaagaa 
ttattcaact 
cggtgcatat 
tcgttatata 
gtttagcacg 
cttacagtta 
ttctctcaag 
tgttatagct 
ttacatctat 
atagttttca 
cggggtgctt 
gaggccatgt 
atatatctta 
tatatttatt 
ttttttgcgt 
tggaaaagct 
cgtgtacgtt 
aaagccttta 
agaaaaaggg 
tccattttct 
actcttttca 
tgccttagtg 
aatcgtttgt 
cctctatcaa 
ttcaatacat 
gtagtatctt 
gcgatcgttt 
cgcatacctg 
tgcatgttat 
tagttataca 
agtgtgacgc 
atagcttctt 
aatttgttaa 
gaactgttct 
aagccagatg 
gtttttaatt 
aacattgcct 
catttatctg 
ttatttttaa 
aagggtaggc 
cgccacttat 
tgttttaaag 
aatatattat 
tttttaattt 
tttatgcaag 
ttattaatgt 
aattcaataa 
aaaacttctt 
aaataatact 



gaaaggctgc 
aaatatggtc 
gttttcccgt 
ttaaatactt 
gcgcaacctg 
atgtatttta 
attattaagc 
gtagccggtc 
tttatacgta 
gttgcattat 
gttaatgtag 
attgttctag 
atctcaagtc 
aacttaggac 
gcagaaataa 
atggattgga 
ggtaagccta 
aatccaccag 
caaggcaatt 
aattcaagaa 
tgtattaatg 
gcgacaggag 
atttagtatt 
ttaaataact 
atttaaatgt 
agccttcggg 
ccttgttcac 
tactactccc 
ttatgttata 
atctgactgt 
acaacataga 
cccctacaac 
ttttttgggg 
aaaaggttaa 
agagaatgac 
atatcagttg 
cagatacctt 
ctgttacatg 
taattgcttt 
tgtgagtagt 
ttatcctact 
catagcttgg 
ttgctatcct 
tgtgaccaaa 
tatttttgag 
ttaaagcttg 
tatcgttcag 
ttttcgcttc 
tatttaatat 
tcatatgtcc 
taaatgtttg 
ttttgatgtt 
tttttatatg 
cgcttgacga 
ctttttgcga 
tatacggatc 
atttttcaaa 
gggctaccca 
aattataaga 
gataaacctt 
tattaattct 
tttggggtaa 
cgtaactatt 
ttctgtcaat 
aataatcttt 
ttaatatagc 
cccatttcaa 



aaggtttata 
aaataattaa 
caaagtatgg 
tcacatcatt 
gttggggtcc 
ttaggttaaa 
aagcgactac 
atggttataa 
aatatataac 
acggtggctc 
gcaataaaaa 
aaatacattt 
aattcaatgc 
aaataagagg 
atataaatta 
ttaagaaaaa 
taggtggttt 
tgccagcagg 
acacagtagc 
ttacaggggt 
gttatagatg 
aggtagacaa 
tacttagaat 
atttggatgg 
agataacagg 
ctagtttttt 
ccaagcatgt 
cgtagtatat 
attgctttta 
tggtcccaca 
aatgttacat 
caacaaaacc 
caaaaaaagg 
aaatgacaaa 
cggtttacca 
ttacaaagga 
ttagtacaca 
tgtatacacc 
taacgatata 
aactttttta 
gccttgcata 
ttcccattgt 
tgaattgatg 
tccagcatta 
gtcaacatct 
aacttctaca 
tataaaatcg 
ttctttttct 
gtgttcgttt 
aagttgacgc 
catgtacttt 
tttgattctt 
atattcaagc 
cttgttgttt 
ttgctttgta 
tttgtatttc 
ccacatttta 
tgaaaattgt 
ttacatggtt 
taatatatta 
atttatcagt 
aacttttctt 
accttttaat 
tttatttaat 
agtgatgaat 
tgaattattt 
atcaaaattc 



tgcttataat 
aaactatgac 
tggcggagct 
tggtcaaaac 
tgaaactgtg 
cttccctaac 
aaaaaaagag 
cgatcctgga 
gcctaatatc 
aagtcaatca 
agattatggc 
agacgcagca 
agatactatt 
tgtgacacct 
tcgtttatct 
ctatgacttg 
ggtagctggt 
ttatacactc 
taatgttaaa 
attacccaac 
gattacttat 
ggcaggtaat 
aaaaattttg 
atgttaatat 
caggtacttc 
gttatgatgt 
cactggatgt 
atgactttag 
tatagtagga 
ggagacatct 
tcgctataac 
acagatccta 
gcagattatt 
aaccttgata 
tcatacaagg 
tttgtagcgt 
agtttttcta 
tttatagtcg 
ttcatttccg 
tttatattta 
ggatttcctt 
tgcatctttt 
gcgatttttc 
catttgattc 
ttaacttgga 
gccccagcaa 
cgtatctgta 
atatcttcta 
ggataattgt 
tttacctgat 
gtatcaattt 
gttttcaaat 
cattcatcta 
agtttttctt 
ttcttattca 
tcgtagtatc 
catccctcct 
ataaaaaaag 
aattaccaaa 
aaattatatc 
aacataatat 
atgcgaaact 
ttttttacct 
ttattttcaa 
tctgtgttgt 
tgcgcgctaa 
atctttaaat 



atcccgtttg 
agctttttac 
ggacacgttg 
tggaacggta 
acaagacatg 
aacttaagcg 
gcagtaatta 
gcagtaggaa 
gctaagtatt 
caagatatgt 
ttatattggg 
ggagaaagcg 
gataaaagta 
cgtaatgatt 
gaattaggtt 
tattctaaat 
aatgttaaaa 
gataagaata 
ggtaataatg 
aacacaacaa 
attgctaata 
agaataagta 
ctacattaat 
tcctatacac 
ggtacttgcc 
gttacacatg 
tttttcttgc 
cattcccgta 
gtgaactata 
tccttgtcat 
cgtatcttaa 
ttaatttagg 
tgaaaaaggg 
caacagtgtt 
gtgggattaa 
ctttaaaaat 
atttttgctc 
ttttttcatc 
ccaataaact 
atgattctgc 
ggcaagttgt 
tattttctaa 
ttcttgaacc 
tgtgaatagt 
gagctaataa 
ctaaaatacg 
ttacctgttc 
tcgtcttact 
aaaatttaac 
ttgcagaata 
tgtttaaaag 
tatcaagcgt 
ataacgcgtg 
ttattttttc 
agacaacact 
tatacttcgt 
caaaattggc 
acgcctgtat 
aatggtaacg 
atcttatatc 
ccgaagaatc 
tactaatcgg 
tatcaattgc 
tttctaaact 
ttttttggta 
ttaaatttaa 
actttttgtt 
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23 701 ttctttagag gataagggaa taacatttac 

23 761 catcactatt gcaaagtgtg aattagaaaa 
23821 aaaaactatt tctccttgtt taaactttgg 
23881 aaatctcttg agtaaatagt gaatatctga 
23941 agtttttaat ttattaatgc gtttttctat 

24 001 gctcacactc tcaccaccat tcaacgtcta 
24061 cataatgaat cttctttggt taacttatcg 
24121 tatttacgcg cattatgtga cgataaatct 
24181 ccactagtta aaacttcata tactatagtt 
24241 attataaact ccttttaaac actgctgaaa 
24301 tactttaatt ctttaatcca catatattta 
24361 cttaaagtta agattgcttt tttcatgtca 
24421 cgctaaatat acgttattaa tcacaataca 
24481 gaagcgactt tgatatcatc atacttcgga 
24541 catatatcta cacgcttgat aagacttact 
24601 tctttaatag aatcttcttt cttaataaaa 
24661 attgaatcac cattaactaa aatacaaaaa 
24721 aaaaatactt cttcatgcaa tatgtcatca 
24781 gcaccacatg caatatacga tactagttta 
24841 ttattctgtt catctaattg ctcatttgca 
24901 gtgagttgag aaaatatgtt attgattttt 
24961 tcaggaactc gataagaatc tacatcatac 
25021 gttttagata ataagaataa tttatgttgg 
25081 gataagtgac tttttgacat tttaatattc 
25141 agaatatcta cttgacgcaa gttcctatct 
25201 ttttgcattg gtaatgcctc cttgaaattc 
25261 caaaagttca acttttttaa ctttttgtgt 
25321 ttatagttca aatgtttgaa cttaggaggt 
253 81 ttttcgttat tgaacggtaa gatagtcgaa 
25441 gctttaggtg tatcagaaag aactttgtct 
25501 acaacagaca ttattaaagc ttgtaagtta 
25561 tattttttta aacagaaagt tcaaatgttt 
25621 tgcaagaacg agaaaaggtt aataaaagta 
25681 ttaggacaaa ttgaagctta cgacaaaacg 
25741 tacaacaaac acctaagcat gaacaacgaa 
25801 gatgaaatta ctaaaaagct acgaagtgct 
25861 ttagacaagc ttaacggtgt tgaaagcgat 
25921 aaagtgaatc agtaacattc acttcttaat 
25981 cagatgtgag cgagagctgg cgatgatatg 
26041 gcgataaccg tctgctgaat gtgggtgttg 
26101 ttacaaacat ttaattttaa agagctacca 
26161 tattttgtag gaaaagatat tgctgagatt 
26221 agaaatcatg ttgatagcga ggacaagctg 
26281 agaaatatga tcattatcaa cgaatcagga 
26341 caaagcaaaa acgaaaaaat tagagaaacc 
26401 gatgtcctac cagctattcg caaacacggt 
26461 acattaaaag atccagacta catcattaca 
26521 caaaacttac ttttacaaca gcaagtagaa 
26581 tcggtagctg gtagtgataa ttcaatactt 
26641 aacggtgttg atataggaca aaacagattg 
26701 attaaaaaga gtggagaaag ttataactta 
26761 ttggatatca aaaaacgaat aattaataat 
26821 ccaaaagtaa caggcaaagg acaacaatac 
26881 acatcttaaa aggaggaaca caatggaaca 
26941 aattatagca aaagaagtta gagaggctat 
27001 aattttcagt aaagtaagaa tcaataatga 
27061 tttcgcaaaa gatttgtcgc taggaagatt 
27121 aaagtatcag catggcttcg aatcaattca 
27181 ccatattaga aaattaacat tatcaatttt 
27241 aagtgaatac aacctagcag caaaagttta 
27301 ctatgaaaag agagtttcag aattaactat 
27361 atgttacaaa aatttagaat tgcgaaagaa 
27421 catgctagtt actgtttaga aagaaacaac 
27481 ttgttgaaaa aggttagcta aattcaacgg 
27541 tttgagatcc aacaaacaca taagttttag 
27601 cttttgtaac agtttcaatt ccttcatatc 
27661 catgtttttt tgaaccatcc tttaaagtaa 
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tatatcctcc gtattagaat catttttatt 
ttctttatta acgtttatac cgaaatctac 
ataaaaacct ttatggtttt tttcaccttc 
atctaacttt ttaaattttg gatttccaga 
attatgcgtc atcatttctc ctttattctc 
cacttgtagg cgttttttga ttagtaaaat 
ccatctattt tttgtgaaat aaattccaag 
ttaggtaact cataagtgaa tggttgatta 
tcttttttta ttttgcaatt agttattttc 
tagacgtctt tttcaaataa gcatgattaa 
aaagtgaggt agtaggtaat aaatataaga 
atttctcctt tgtttatatt tatattaaag 
actttgccca ttactttaat atcactaaac 
tttagagata ccaaattaat atagtcttcg 
ccatctaata caacgagtgc aattgtacca 
gcgtatgttc cttgttttaa cataggttcc 
tcagcatttg atggcgtttc gtcttcttta 
tataattctt ctcctatgcc agcaccagtt 
gactctttat attcatctat agaagtgact 
tagttaagta cgttttcttg gcggggaggt 
gacattatcg tttcatcttg acgttcttcg 
cccataagcc acgcttcacc gacatttaaa 
tctggagaag accttccatt aacatactgg 
aattcttttt gaaagggttt cgacttttct 
ttcataattt gttttaatct ttcagaagtg 
attatatagg aagggaaata aaaatcaata 
tgacattgtt caaaattggg gttatagtta 
gattatttga atactaatac aacttttgat 
gtgtactcga cacaatttaa ctttgctata 
ttgaagttga acaacaaagt accatggaaa 
ttgggaatac ctataaaaga tgttcacaaa 
gaacttaata agtaaaggag gcataacaca 
acacatcttc aaatgaagca tcaaaacctt 
cttaaagaaa taaagtacac tcgagacctt 
gacgcattcg ctggtttgga aatggtagag 
atcaaagagt tccaaaaagt agtgaaagcg 
aacaaagtta ctgatttaac agagtggcgg 
ataaccacgc ttatcaacat ccacattgag 
agccgcgttt aaatacattc gatagtcatt 
aggaaaaagg aggatactca aatgcaagca 
gtaagaacag tagaaattga aaacgaacct 
ttaggatatg caagatcaaa caatgccatt 
acgcaccaat ttagtgcatc aggtcaaaac 
ttatacagtc taatcttcga tgcttctaaa 
gctagaaaat tcaaacgctg ggtaacatca 
atatacgcaa cagacaatgt aattgaacaa 
gtgttgactg agtataagaa agaaaaagag 
gttaacaaac caaaagtatt attcgctgac 
gttggagaac tagcgaaaat acttaaacaa 
ttcaaatggt taagaaataa tggatatctc 
ccaactcaaa agagtatgga tctaaaaatc 
ccagatggtt caagtaaagt atcacgtaca 
tttgttaata agtttttagg agaaaaacaa 
aatcacatta accaaagaag agttgaaaga 
aaatggcaag aaaccaatca gttcaggttc 
cgatttagaa gaaatcaata aaaaactcaa 
gaggaagctc aatcatccga ttccgctaaa 
tcaaaaagct tatgtacaag atgttcatga 
tggagtgaca cttaattcag acttgagtga 
tcgagaaatc aaaaactatt atttatacat 
cgatgatttc gaataaagga ggaacaacaa 
aaaaataaat taaaactcaa attactcaag 
aaccctgaac tgttgcgagc agttgcagag 
taaggatttg ccctgcctcc acacttagag 
tagggtctag aaaaaatgtt tcgatttcct 
ctggaaaaac aattttcttt aaatccgaaa 
ctagaagttt catacttatc acctccttag 
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27721 gttgataaca acattataca cgaaaggagc 

27781 tcgaacatcg gagaaatgtt caatattcaa 

27841 ggtcgagaac ttcatcaagc attagaagtt 

27901 atgcttaaat acggatttga agaaaataca 

27961 acagctcaag gcaatatgac tcactatatt 

28021 gaaatcgcaa tgattcaacg tagtgaacct 

28081 gttgaaaaag catggaacag cccagaaatg 

28141 aacacaatca atcaattaga aacaaagatt 

28201 gatgcagtag ctactactaa gacatcaatt 

28261 caaaacggta taaacatcgg gcaacgcaga 

28321 cttattaaac gcaagggtgt ggattataac 

28381 ttattcgaaa ttaaagaaac atcaatcaca 

28441 acgccaaaag taacaggtaa aggacaacaa 

28501 caaacaactt aataggagga attacaaatg 

28561 acaatggcag ttgtgacgtg gaaggtttgg 

28621 attagtagca gggcgttgag tgactatcta 

28681 gctgaaaatt ctactgaatc tgctcgtcgc 

28741 aaataacaac attatacacg aaaggaaaga 

28801 caccagaaaa cacatataga ggcgaagaaa 

28861 cacaaatcca tcaattgttt ggagtatgta 

28921 accgcaaaga taatttaggt gtagaaaatt 

28981 tgattaatat ttctaaattg gaagagtatt 

29041 aggatattaa atgagcaaca tttataaaag 

29101 cttagcgatt gtacttatgc cgtttctata 

29161 cgcaagtatc gcaacattca tgtactacaa 

29221 ctacttgttg gagcaagtaa cagtatcaaa 

29281 aacgaaaaac ggaggaagtc aagatgtatt 

29341 ttcatgttaa cggattcgat tttaagctat 

29401 tacaagttaa agatatgaac aacgtaccaa 

29461 acttagatat ggcatcagac ttatttaacc 

29521 cagacgaaca ggacagacta attaacttag 

29581 agactgtaac ttatatcatt cgtcataggg 

29641 ctgataacaa ttcagatatt agttactcca 

29701 gtatggaaga agcgagtatc aatatggatt 

29761 aaactattga gtacgaggag gtagaacatg 

29821 gtaagcatac tcaaaaaact aaagataaat 

29881 tataaatttg cagtatacgg aaaaattggc 

29941 aaagacgctt tcgtcattga cattaacgaa 

30001 gacgtagaaa tcgagaacta tcaacacttt 

30061 ttacaggaga tgagagaaaa cggacaagaa 

30121 aaacttagag atatgacatt gaatgatgtg 

30181 aatgattggg gagaagttgc tgaacgaatt 

30241 caagaagaat acaaattcca ctttgttatt 

30301 gatgatgaag gtagcactat caaccctact 

30361 aaagctatta cttctcaaag tgatgtgtta 

30421 aacggagaaa agaaagctag atatattcta 

30481 aagattagac attcaccttc aataacaatt 

30541 acggacgtag tagaagcaat tagaaatgga 

30601 aattatgaaa atcacaggac aagcgcaatt 

30661 taacggctca gcagggtttc aagctggaga 

30721 caatgataga gaaaatagat atttcacaat 

30781 taaacataat caatttgtac cgccgtataa 

30841 attagttact cgattaggta ttaagttaaa 

30901 tcttattggt aagttttgtc acttggtatt 

30961 gtattttacg gatttttcat ttattaaacc 

31021 acctattccg aagacagata agcaaaaagc 

31081 atcaatgtct caacaaagca atccatttga 

31141 agatttagcg ttttaaggtg tggtttaaat 

312 01 cgacggtact tattccgtcg ttgctactgg 

31261 actagaaaac ggatatccac taaaagcaga 

31321 tatagaacaa cgcaaaaaaa tattcgcaat 

31381 accagtagaa tcaactagaa aattattaca 

31441 agaaatcagt ctgcgcgact gttctatgaa 

31501 agcgtttatg tttcatcatc aaatacctat 

31561 agataaagcg ttattatatt gggctacaat 

31621 tcacgcagac ctggcacatt atgaagcagt 

31681 ccactatgac aaacatgtat tagcgttatg 

SD-63311.4 



ataaacaata tgcaagcatt acaaacaaat 
gaaaaagaaa atggagaaat cgcaatcagc 
aagacagcat ataaagattg gtttccaaga 
gattacacag ctatcgctca aaaaagagca 
gaccacgcac tcacactaga cactgcaaaa 
ggcaaacgtg caagacaata tttcatccaa 
attatgcaac gtgctttaaa aattgctaac 
gcacgtgaca aaccaaaaat tgtatttgca 
ttagttggag agttagcaaa gatcattaaa 
ttgtttgagt ggttacgtca aaacggattc 
atgcctacac agtattcaat ggaacgtgag 
cattcggacg gtcacacatc aattagtaag 
tactttgtta acaagttttt aggagaaaaa 
aacgcactat acaaaacaac cctcctcatc 
aagattgaga agcacactag aaaacctgtg 
aacaacaaat ctttaaccat accgaaagat 
cttttgaagt tcgccgaaca aactattagc 
tagaaatgcc aaaaatcata gtaccaccaa 
aatttgtgaa aaagttatac gcaacaccta 
gaagtacagt atacaactgg ttgaaatatt 
tatacattga ttattcacca acaggcactc 
tgatcagaaa gcataaaaaa tggtattagg 
ctacctagta gcagtattat gcttcacagt 
cttcactaca gcatggtcaa ttgcgggatt 
agaatgcttt ttcaaagaat aaaaaaactg 
cacttaagaa aaaattcatg ttcaatataa 
acgaaatagg cgaaatcata cgcaaaaata 
tcattttaaa aggtcatatg ggcatatcaa 
ttaaacatgc ttatgtcgta gatgagaatg 
aagcaataga tgaatggatt gaagagaaca 
tcatgaaatg gtaggaggtc gctatgaagc 
atatgccaat ttatataact aacaaaccaa 
caaatagaaa tagagctagg gagtttaacg 
atcacaaagc aatcaagaaa acagtgacag 
actgaggaaa aacaagaacc acaagaaaaa 
aatatcgctg agaaaaataa aaggaaattc 
tcaggaaaaa ccacgtttgc tacaagagat 
ggtggaacaa cggttactga cgaaggatca 
gtttatgttg taaatttttt acctcaaatt 
atcaatgttg tagttattga aactattcaa 
atgaaaaata agtctaaaaa accaacgttt 
gtcagtatgt acagattaat aggaaaactt 
acaggtcatg aaggtatcaa caaagataaa 
atcactattg aagcgcaaga acaaattaaa 
gctagggcaa tgattgaaga atttgatgat 
aacgctgaac cttctaatac gtttgaaaca 
aacaataaga aatttgcaaa tcctagcatt 
aactaaaaat taattaaaag gacggtattt 
tactaaagaa acaaatcaag aaaagtttta 
attcacagtg aaagttaaaa atattgaatt 
cgtatttgaa aatgatgaag gcaaacaata 
atatgatttc caagaaaaac aattgattga 
tcttcctagc ttagattttg ataccaatga 
gaaatggaaa ttcaatgaag atgaaggtaa 
ttacaaaaag ggcgatgatg ttgttaacaa 
tgaagaaaat aacggggcac aacaacaaac 
aagcagtggc caatttggat atgacgacca 
gcaatacatt acaagatacc agaaagataa 
tgttgaactt gaacaaagtc acattgactt 
agtagaggtt ccggacaata aaaaactatc 
gtgtagagat atagaacttc actggggcga 
aacagaattg gaaattatga aaggttatga 
agttgcaagg gagttaatag aactgattat 
gagtgtagaa acgagtaagt tgttaagcga 
caaccgcaac tgtgtaatat gcggaaagcc 
cggcagaggc atgaacagaa acaaaatgaa 
tcgcgaacat cacaacgagc aacatgcgat 
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31741 tggcgttaag tcgtttgatg ataaatacca cttgcatgac tcgtggataa aagttgatga 

31801 gaggctcaat aaaatgttga aaggagagaa aaaggaatga atagactaag aataataaaa 

31861 atagcactcc taatcgtcat cttggcggaa gagattagaa atgctatgca tgctgtaaaa 

31921 gtggagaaaa ttttaaaatc tccgtttagt taatacaggt ttttacaaaa gctttaccat 

31981 aggcggacaa actaattgag ccttttttga tgtctattac ccaggggctg taatgtaact 

32041 ttaatacttc aaattcaatg ccagaaagtt tacttattgt ttctaggttg tgtcctgact 

32101 ttaacattct tttaacaaat tctaatcccg aaacaaatct ttgtttttct ataatcttat 

32161 taaagtgatt taaaaactga ggagcataaa acttattata aattcctttt tttgttaagt 

32221 aagacatgtc aaaagtttca tttaaaaccc ctaaccttac taggttatta attgaaattt 

32281 cggttgattc tatatctaac ggagagtctt ttattaacgt gtccgatata ttcataccgt 

32341 cattctttgg gtttaaaacc gctctatatt taacggcagg atgtacttcg tgattcttta 

32401 aatgttttaa aagaatagca tcatttgggg ataattgttt aattatttca acaaatgaat 

32461 ggtgggttaa tgagtttttt ctgtcatcca tagatgatgc tattagtttt gcgaacatat 

32521 tacttaaagt tttttcacta atgtaaaact ttgaagcttc tagagcagga cctagaagag 

32581 aaaattgtgg ttcttgtaaa ttatttttag gtacagaaga tatttctttt ttaaattgtt 

32641 ctttgaattt ttcaaattct acttctcttt gataaataac tttatccaca taaaggtgga 

32701 atttcccaaa gacaagttcc caagttttag agaatgtttc tacaggccct tttgatgcgc 

32761 cttcaataat tttatcaata cctttaccta aaataggatc cataattatt cacccccaat 

32821 ctaacgcaat agcgataata aaattatacc agaaaggaga atcaacatga ctgaccaacc 

32881 aagttactac tcaataatta cagcaaatgt cagatacgat aaccgactta ctgacagcga 

32941 aaagttactt tttgcagaaa taacatcttt aagtaacaaa tacggatact gcacagcaag 

33001 taatggttac tttgcaactt tatacaacgt tgttaaggaa actatatctc gtagaatttc 

33061 gaaccttacc aactttggtt atctaaaaat cgaaattatc aaagaaggta atgaagttaa 

33121 acaaaggaag atgtacccct tgacgcaaac gtcaatacct attgacgcaa aaatcaatac 

33181 ccctattgat aattctgtca atacccctat tgacgcaaat gtcaaagaga atattacaag 

33241 tattaataat acaagtaata acaatataaa tagaatagat atattgtcgg gcaacccgac 

33301 agcatcttct ataccctata aagaaattat cgattactta aacaaaaaag cgggcaagca 

33361 ttttaaacac aatacagcta aaacaaaaga ttttattaaa gcaagatgga atcaagattt 

33421 taggttggag gattttaaaa aggtgattga tatcaaaaca gctgagtggc taaacacgga 

33481 tagcgataaa taccttagac cagaaacact ttttggcagt aaatttgagg ggtacctcaa 

33541 tcaaaaaata caaccaactg gcacggatca attggaacgc atgaagtacg acgaaagtta 

33 601 ttgggattag ggggatatta tgaaaccact attcagcgaa aagataaacg aaagcttgaa 

33661 aaaatatcaa cctactcatg tcgaaaaagg attgaaatgt gagagatgtg gaagtgaata 

33721 cgacttatat aagtttgctc ctactaaaaa acacccgaat ggttacgagt ataaagacgg 

33781 ttgcaaatgt gaaatctatg aggaatataa gcgaaacaag caacggaaga taaacaacat 

33841 attcaatcaa tcaaacgtta atccgtcttt aagagatgca acagtcaaaa actacaagcc 

33901 acaaaatgaa aaacaagtac acgctaaaca aacagcaata gagtacgtac aaggcttctc 

33961 tacaaaagaa ccaaaatcat taatattgca aggttcatac ggaactggta aaagccacct 

34021 agcatacgct atcgcaaaag cagtcaaagc taaagggcat acggttgctt ttatgcacat 

34081 accaatgttg atggatcgta tcaaagcgac atacaacaaa aatgcagtag agactacaga 

34141 cgagctagtc agattgctaa gtgatattga tttacttgta ctagatgata tgggtgtaga 

34201 aaacacagag cacactttaa ataaactttt cagcattgtt gataacagag taggtaaaaa 

34261 caacatcttt acaactaact ttagtgataa agaactaaat caaaatatga actggcaacg 

34321 tataaattcg agaatgaaaa aaagagcaag aaaagtaaga gtaatcggag acgatttcag 

34381 ggagcgagat gcatggtaac caaagaattt ttaaaaacta aacttgagtg ttcagatatg 

34441 tacgctcaga aactcataga tgaggcacag ggcgatgaaa ataggttgta cgacctattt 

34501 atccaaaaac ttgcagaacg tcatacacgc cccgctatcg tcgaatatta aggagtgtta 

34561 aaaatgccga aagaaaaata ttacttatac cgagaagatg gcacagaaga tattaaggtc 

34621 atcaagtata aagacaacgt aaatgaggtt tattcgctca caggagccca tttcagcgac 

34681 gaaaagaaaa ttatgactga tagtgaccta aaacgattca aaggcgctca cgggcttcta 

34741 tatgagcaag aattaggttt acaagcaacg atatttgata tttagaggtg gacgatgagt 

34801 aaatacaacg ctaagaaagt tgagtacaaa ggaattgtat ttgatagcaa agtagagtgt 

34861 gaatattacc aatatttaga aagtaatatg aatggcacta attatgatca tatcgaaata 

34921 caaccgaaat tcgaattatt accaaaacta gataaacaac gaaagattga atatattgca 

34981 gacttcgcgt tatatctcga tggcaaactg attgaagtta tcgacattaa aggtatgcca 

35041 accgaagtag caaaacttaa agctaagatt ttcagacata aatacagaaa cataaaactc 

35101 aattggatat gtaaagcgcc taagtataca ggtaaaacat ggattacgta cgaggaatta 

3 5161 attaaagcaa gacgagaacg caaaagagaa atgaagtgat ctaatgcaac aacaagcata 

35221 tataaatgca acgattgata taaggatacc tacagaagtt gaatatcagc attttgatga 

35281 tgtggataaa gaaaaagaag cgctggcaga ttacttatat aacaatcctg acgaaatact 

35341 agagtatgac aatttaaaaa ttagaaacgt aaatgtagag gtggaataaa tgggcagtgt 

3 5401 tgtaatcatt aataataaac catataaatt taacaatttt gaaaaaagaa ataatggcaa 

35461 agcgtgggat aaatgctgga attgtttcta aacgtgttag aggttgttgg gagttttcag 

3 5521 aagctttaga cgcgccttat ggcatgcacc taaaagaata tagagaaatg aaacaaatgg 

35581 aaaagattaa acaagcgaga ctcgaacgtg aattggaaag agagcgaaag aaagaggctg 

35641 agctacgtaa gaagaagcca catttgttta atgtacctca aaaacattca cgtgatccgt 

35701 actggttcga tgtcacttat aaccaaatgt tcaagaaatg gagtgaagca taatgagcat 
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35761 aatcagtaac agaaaagtag atatgaacaa 

35821 ttacacatac ggcgacattg aaattataga 

35881 accacaatta gcattcgcaa taggtaatgc 

35941 gaatggtcat gaggatttag caaaggcgaa 

36001 ggagtgatga ccatgacaga tagcggacgt 

36061 aagagatatc tgtatcagga taacgaacga 

36121 tattactttc acggtcatat cgtgccaggt 

36181 gcggaagagc ttgaaacata tataaagcaa 

36241 ctaactttat tttaaaaggg cggaaacaat 

36301 acctgaactt atccaatggg cttgggataa 

36361 ttcaaatgat gttgagcgca actgttttgt 

36421 tgtgactgga tatgtatcaa ttaacgataa 

36481 aaaatcaaag ttaaaaaaga aatgagatta 

36541 ccggatctat cacaaggaaa aatatttttt 

36601 tttcatccaa atacaaataa gtgttcgacg 

36661 atagttgata ttgaaaaaga agtaacggaa 

36721 ttcgagattc aagaaggaga ctataactct 

36781 tgtttatatg gcagatgtgt gcctaccaaa 

36841 atgacgttaa tctggaaaga tggggagttg 

36901 ataaagataa aaaagttatg agtattattg 

36961 tgatttcaac aggttataaa agtttcaatg 

37021 aagatgtgca cggtgtggag atttatgaag 

37081 aagtaagttt tatcgagttt aaagaaggag 

37141 aattactaag tgaaaatgac gatattattg 

37201 tgctattgga ggttatgaga tgacgttcac 

37261 tactaactct aacaagttat tagataaact 

37321 caagaagcaa cgagatgagc ttattgggga 

37381 tctagagaag aaagcaagcg catgggatag 

37441 aaacgaattc ggtaacgatg atgaaagagt 

37501 ttttatggag gatgacacaa atgaataatc 

37561 ctagtgcgta taacggtaat gacacagagg 

37621 agaaagcgca agcgtttgat gaaatacttg 

37681 ttaaagaagg tattgaactt gatgaagcag 

37741 aatatgagga ggaataggaa aatgactaac 

37801 gctagaatgc ccgaacgaaa tcataagacg 

37861 actgtcgtac tcgaaccaca agaaaaagca 

37921 ccagagggct atgtcggact attaactagt 

37981 gtgattgaaa caggcaagat agacgcggga 

38041 aatgatgaag aacgtgatgg aatacccttt 

38101 gatggattaa taagcatttt agatataaaa 

38161 agaagagttt accaaatcaa caaaggcgat 

38221 tggacaccgg aactaaagca agtggaggaa 

38281 ggcttcggaa gtagcggagt gtaaagacat 

38341 gtgacgcaat acttagtcac aacattcaaa 

38401 actgtggcta gagataatca gacgtttaca 

38461 aaagagaagt acgaggcaca agttaaaaga 

38521 gaaaatataa gggagtgtgg gaaatgacgg 

38581 tttattttgt aaaaacagct gaaccttttg 

38641 atggttatat ttacgcaagt actataatca 

38701 ttgaatcaat cacacttatt gaggagcatg 

38761 tgacggtagt agtttgatat tacatgaaga 

38821 ggacaatttt agaaatgatg atgactattt 

38881 tgtattgaac aaaggttata tagttgggat 

3 8941 tacctaaaat gaaattcccg aaaaagtaca 

3 9001 cacctgaaga aaaggctaag attgaagatg 

3 9061 gtgaatttta cagtcctacg atggctaata 

3 9121 gaatgatgcc tagtttaatt gatactggag 

3 9181 atggatgggt tcgacatctt tattgttgga 

39241 ctacttgtta tcacattgcc tatctataca 

3 9301 caaggaacta ttacagataa atataacaag 

393 61 gtattagaca acaaacaagt cattgaaaat 

39421 agcgcagata tacaagctag gttaaaagta 

39481 tatagaatac actttttaaa tttatatccg 

3 9541 caatgattaa acaaatacta agactattat 

3 9601 atgtaactga gcaagtgtat attatgatga 

39661 attacgtctt tcgagcggag gtgagtgaat 

39721 tttgctgttt gctttcttaa tatccatata 
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aacgcaagac aacgttaagc aacctgcgca 
ttttattgaa caagttacgg cacagtaccc 
aattaaatac ttgtctagag caccgttaaa 
gttttacgtc gatagagtat ttgacttgtg 
aaagaatact taaaacattt tttcggctct 
gtggcacata tccatgtagt aaatggcact 
tggcaaggtg tgaaaaagac atttgataca 
agtgatttgg aatatgagga acagaagcaa 
gaaaatcaaa attgaaaaag aaatgaattt 
ccccaagtta tcaggtaata aaagattcta 
gacttttcat gttgatagca tcttatgtaa 
atttactgtt caagaggaga tataacaatg 
gatgaattaa ttaaatgggc gcgagaaaat 
tcaacaggat ttagtgatgg attcgttcgt 
tcaagtttta ttccaattga tatccccttc 
gagactaagg ttgataggtt gattgaatta 
acactatatg agaacactag tataaaagaa 
gcattctaca tcttaaacga tgacctaact 
ctagtatgat gttgaaattt aaagcttggg 
acgaaatcga ttttaatagt gggtacattt 
aagtaaaact attacaatac acaggattta 
gggatattgt tcaagattgt tattcgagag 
ccttttatat aacttttagc aatgtaactg 
aaattgttgg aaatattttt gaaaatgaga 
cttatcagat gaacaatata aaaatctttg 
tcacaaagca ttaaaagatc gtgaagagta 
tatagcgaag ttacgagatt gtaacaaaga 
gtattgcaag agcgttgaaa aagatttaat 
taaattcgga atggaattaa acaataaaat 
gcgaaaaaat cgaacagtcc gttattagtg 
ggttgctaaa agagattgag gacgtgtata 
agggaatgac aaatgctatt caacattcag 
tagggattat ggcaggtcaa gttgtctata 
acattacaag taaaactatt atcaaaaaat 
gatgcaggtt atgacatatt ctcagctgaa 
gtgatcaaaa cagatgtagc tgtgagtata 
cgtagtggtg taagtagtaa aacgtattta 
tatcatggca atttagggat taatatcaag 
ttatatgatg atatagacgc tgaattagaa 
ggtaactatg tacaagatgg aagaggcata 
aaactagctc aattggttat cgtgcctata 
ttcgaaagtg tttcagaacg tggagcaaaa 
cttagatcga gttaaggagg ttttggggaa 
gattcaacag gacgaccaca tgaacatatt 
gttattgagg cagagagtaa agaagaagcg 
gatgcagtta ttaaagtggg tcagttgtat 
atgttaaaat taaaactatt tcaggtggag 
aaaaatatgt tgaaagaatg acgagtttta 
agaaaccaac gtatattaaa acagatacga 
ggaaatgaat cagctgagaa ttttattaca 
tgaattattt aacgaaatag tatttgtttt 
aacgatagaa aaagattatg gcagagaact 
caatgttgag gaggcagatg atgattaaca 
ctgaaataat caaaaaatat aaaaataaag 
attttattaa agaaattaaa gataaagaca 
tgaatgaata tgaattaagg gctatgttaa 
atgacaatga tgattaaaaa acttaaaaat 
atactgtcat tattcggtat attcgcattg 
gtggctagtt accaacacaa agaattacat 
agacaagata aagaagacaa gttctatatt 
tccgacttat tattcaaaaa gaaatttgat 
ggcgataagg tagaagttaa aacaatcggt 
gtcttatacg aagtaaagaa ggtagataaa 
tcttactagc aatgtatgag ttaggtaagt 
cggctaatga tgatgtagag gcgccgagtg 
aatgagaata tttatttatg atttgatcgt 
tattattgat gatggagtga taataaatgc 
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39781 attaggaatt tttggtatgt ataaaattat agattccttt tcagaaaata ttataaagag 

39841 gtagataaaa atgaacgagc aaataatagg aagcatatat actttagcag gaggtgttgt 

39901 gctttattca gttaaagaga tttttaggta ttttacagat tctaacttac aacgtaaaaa 

39961 aatcaattta gaacaaatat atccgatata tttagattgt tttaaaaagg ctaaaaagat 

40021 gattggagct tatattattc caacagaaca gcatgaattt ttagattttt ttgatattga 

40081 agtctttaat aatttagata agcaaagtaa aaaagcgtat gaaaatgtta ttggatttag 

40141 acaaatgatt aatttatcaa atagagttaa ggcaatggaa gattttaaga tgagtttcaa 

40201 caatgaattt agtacaaatc agattttttt taatccttct tttgttatgg aaacaattgc 

40261 tattataaat gaatatcaaa aagatatatc ttatttaaaa aatataatta ataaaatgaa 

40321 tgaaaataga gcttataatc atattgatag ttttatcact tcagagtacc gacgaaaaat 

40381 aaacgattat aatctttatc ttgataaatt tgaagaacag tttagtcaaa agtttaaaat 

40441 aaacagaact tcgataaaag aaagaattat tattaattta aacaagagga gatttaaatg 

40501 atgtggatta ctatgactat tgtatttgct atattgctat tagtttgtat cagtattaat 

40561 agtgatcgtg caagagagat acaagcactt agatatatga atgattatct acttgatgaa 

40621 gtagttaaaa ctaaagggta caacgggtta gaagaataca ggattgaatt gaagcgaatg 

40681 aataacgata ttaaaaagta atttatatta tcggaggtat tgcattgaat gataaagatt 

40741 gagaaacacg atatcaaaaa gcttgaagaa tacattcagc acatcgataa ctatcgaaga 

40801 gagttgaaga tgcgagaata tgaattactt gaaagtcatg aaccagataa tgcgggagct 

40861 ggcaaaagta atttgccggg taacccgatt gaacgatgtg caataaagaa gtttagtgat 

40921 aacaggtaca atacattaag aaatatagtt aacggtgtag atagattgat aggtgaaagt 

40981 gatgaggata cgcttgagtt attaaggttt agatattggg attgtcctat tggttgttat 

41041 gaatgggaag atatagcaca ttactttggt acaagtaaga caagtatatt acgtagaagg 

41101 aatgcactga tcgataagtt agcaaagtat attggttatg tgtagcggac ttttacccta 

41161 tgtaagtccg cattaaaaca gtttattatg ttagtatcag attaatattt aaagttatta 

41221 aatgctaata cgacgcatga acaagaggcg catcactatg tgatgtgtct ttttatttat 

41281 gaggtatgaa catgttcaaa ctaattgtaa atacattact acacatcaag tatagatgag 

41341 tcttgatact acttaagtta tataaggtga aacattatga tgactaaaga cgaacgtata 

41401 cgattctata agtctaaaga atggcaaata acaagaaaaa gagtgctaga aagagataat 

41461 tatgaatgtc aacaatgtaa gagagacggc aagttaacga catatgacaa aagcaagcgt 

41521 aagtcgttgg atgtagatca tatattatcg ctagaacatc atccggagtt tgctcatgac 

41581 ttaaacaatt tagaaacact gtgtattaaa tgtcacaaca aaaaagaaaa gagatttata 

41641 aaaaaagaaa ataaatggaa agacgaaaaa tggtaaatac ccccgggtca aaaaaatcaa 

41701 aagcgatc 
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Table 3 





Name 


Position 






xosuion 


1 


HHfYD T7AAC 

/ /UKr 005 


1 G^79 91A9A 

lyj /Z..Z1UZ0 


Ho 


/ /VJlvrUjZ 


1 /0Z..ZUI j 


2 


7/UKrUUo 


f 1 CkC 

jy /o.jlyo 


AQ 


/ /UKtUjj 


77^91 ^77^7 


o 
3 


/ /UKrOO/ 


O 1 0^71 03 A7£ 

218/1. .230/0 


^A 
DO 


/ /UKtUj4 


77215 71HAA 
ZZo1o..ZjUOU 


A 

4 


T7A A O 

77UKi<008 


010A nA7 

2120. .330/ 


f 1 
J 1 


77r\T?TTA^^ 

/ /UKrUjj 


1 / j40.. 1 / /Oo 


5 


O^/TD 17 A AO 


3 iy4o..5zo0j 


DZ 


/ /UJNJr'UJO 


1SRQ7 1Q177 

looyz.. iy 1ZZ 


C 
0 


77/~vut7ai n 
/ /UKrUlU 


9AAQ9 9^220 

zouyz..zoooy 




77rkl?T7A^Q 


j L tD\JH..JH I OJ 


7 
1 


/ /UKrUl 1 


0AAA1 9^902 
Z444 1 ,.ZjZUo 


<\A 


1 1 wivr UO*+ 


zy j /H..Ly / yj 


o 
0 


77rYDT7A1 7 

/ /UKrUIZ 


9G722 7A^7/^ 
Zy /oo..jUj /0 


C f 


770P TTflA^ 


Z0JZ0..Z0 / *40 


o, 

y 


HHf~YD T7A 1 1 

/ /UKrUli 


jjOZU..j4jyy 


jo 


I /UJKrUOO 


11AQA 97701 


1 A 
10 


77rYDT7A1 /I 

/ /UKrU14 


977£A 920 9 
Li /0U..Z0J 1Z 


0 / 


77AP TTA/iQ 
/ /UKrUOy 




1 1 


/ /UKrU15 


19Q1 /1A92 

jzy i.,4uzo 


JO 


770P Trn7fi 


jozoy.. jo^t / d 


1 O 

12 


/ /UKr016 


10C£7 H£1A 

3Zoo/.. 53610 


jy 


/ /UKTu/ 1 


Af\AQQ. A(\l(\\ 

4-U4yo..^u/ui 


1 o 
13 


T7AT) T7A 1 T 

77OKt017 


23269..23982 


/TA 

60 


/ /UKrO/Z 


jo / j j j oy jo 


14 


77ORF018 


31169.. 31840 


61 


T7AT) rA70 

77OKt073 


30y45..31 148 


1 f 

15 


T7 AH T7A 1 A 

77UKr019 


IHOCI /1A^A1 

39851. .40501 


6z 


77AT) T7A7/1 

/ /UKr0/4 


joj44.,jo /jo 


1 ZT 

16 


T7AT) T7AO A 

77UKr020 


6926.. /5 /U 


63 


/ /UKrU/j 


1 1^71 1 127A 
1 JO / J.. 1 JO /U 


17 


T7AT) T7AO 1 

77UK1<021 


1'7'7/CO lOlA/l 

3/ /62. .38304 


64 


77AD T7A77 

/ /UKrO/ / 


Zj jj /..ZjoUj 


18 


77ORF022 


30605. .31 156 


CO. 

65 


77AD T7A'7A 

77(JKrO/y 


OOAQO OQOOA 

zyooy..zyzoo 


19 


77ORF023 


26903.. 27346 


66 


77ORr080 


OfOA/l Tf lOA 

35204. .35389 


O, A 

20 


TlATl T? AO yi 

77OKr024 


1 A*7AA 111 Af\ 

10700.. 11 140 


cn 
6/ 


'7'7/™\T> T7A O f 


O/IA/^A O/fO/lO 

Z4UOU..Z4Z4Z 


21 


77ORF025 


ahah 1 ai /n 

9707. .10147 


68 


7701^x092 


3y /0o..3y8/6 


22 


77ORF026 


/1AT1A » 1 1 /If 

40729. .41 145 


/TA 

69 


77(JKr094 


IOOO/C ^OIOI 

32Zzo..3z3yj 


23 


77ORF027 


/fin /■AI f 

65 18. .6925 


*7A 

70 


77UKr096 


1 1 CC\C 1 O.H'TO. 

13606.. 13 / 15 


24 


77ORF028 


'I ylTAf Of 1 AA 

34795. .35199 


71 


77ORr098 


TfAAO 71C/; 

/092.. /256 


25 


^■7>"T/"\T\ t~i/\o A 

77ORF029 


6H7..6521 


72 


'7TADT71 AO 

77OKhl02 


OAA-C1 OAO 1 O 

29051..2y212 


26 


T7AH FAOA 

77ORF030 


36478. .36879 


73 


77AD T7 1 A/1 

77UKr 104 


j4jyj..j4jM 


27 


77ORF03 1 


39151. .39546 


/4 


/ /UKr 109 


1 Q9CO 1 2/11/1 

lozoz..lo434 


25 


T7AD T7A O O 

77UK1<032 


Tiom "2/iozr/r 
33892. .34266 


/5 


/ /UKr 1 1Z 


'JQ^/t'J "5Q/iQO 

jyj4j..jyoyz 


OA 

29 


77UKb033 


ZHCQ C\ OA 

5 758. .6120 


lb 


/ /UKr 11/ 


97K1 07^A1 

Z/jd1..Z/jU1 


O A 

30 


77ORF034 


7886. .8236 


11 


/ /UKr 118 


38390. .38550 


o 1 
31 


T7/M1 T7AO f 

77ORF035 


1 A1CO 1 Af /A 

19258. .19560 


no 

78 


'7'7/'YDTri OA 

77UKT 120 


36059. .36199 


o ^ 
32 


*T"7/"VT* FA1 Z' 

77ORF036 


36876..37223 


*7A 

79 


nnrvn i? 1 oyi 
77UKT124 


33699. .33833 


o o 
33 


T7AFj T7AO T 

77ORF037 


1 AO A A £ 

102.. 446 


OA 

80 


0*0* T7 1 O O 

77UKT128 


1 /IOO 1 1 yl^CC 

14Z21.. 14355 


34 


77ORF038 


^> A f\r\ O O rA 1 pi 

34908. .35219 


0 1 
81 


77ORF130 


1 CCHC 1 f OA/C 

15675. .15806 


35 


^T^7/^T» T'AO A 

77ORF039 


37220. .37528 


82 


77URF133 


0/11/1 Of/IO 

8414. .8542 


36 


77ORF040 


41377. .41676 


83 


77UKr 140 


10110 1 OOOf 

131 13. .13235 


O T 

37 


T7AT) T7 A /I 1 

77ORF041 


Of AC A IfTf? 

35454..35753 


84 


77UKr 14/ 


7A7 A 7 1 /I C 

/ozy.. / i4o 


o o 

38 


T7AT* T7 A /1 1 

77ORF042 


f AC\f\ C"1H A 

5490. .5774 


Of 

85 


7/UKb 149 


JUOOO..JU /o / 


39 


T7ATi T7 A /I O 

77ORF043 


29304..29564 


86 


77UKT 151 


3183 /..3iy53 


40 


77ORF044 


18481. .18768 


87 


770RF155 


0 Ano oaoai 

30278. .30391 


41 


77ORF045 


5216..5500 


88 


770RF157 


4044..4157 


42 


77ORF046 


25663..25935 


89 


770RF167 


20692..20799 


43 


77ORF047 


11159..11425 


90 


770RF175 


35717..35821 


44 


77ORF048 


28776..29039 


91 


770RF176 


6836..6940 


45 


77ORF049 


36013..36255 


92 


770RF178 


35390..35491 


46 


77ORF050 


35753. .36007 


93 


770RF179 


8318..8419 


47 


77ORF051 


38931..39167 


94 


770RF182 


29268.-29564 
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Table 4 

77ORF017 sequence 

23982 atgacgcataatatagaaaaacgcattaataaattaaaaacttct 

1 MTHNIEKRINKLKTS 

23937 ggaaatccaaaatttaaaaagttagattcagatattcactattta 

16 GNPKFKKLDSDIHYL 

23892 ctcaagagatttgaaggtgaaaaaaaccataaaggtttttatcca 

31 LKRFEGEKNHKGFYP 

23847 aagtttaaacaaggagaaatagtttttgtagatttcggtataaac 

46 KFKQGEIVFVDFGIN 

23802 gttaataaagaattttctaattcacactttgcaatagtgatgaat 

61 VNKEFSNSHFAIVMN 

23757 aaaaatgattctaatacggaggatatagtaaatgttattccctta 

76 KNDSNTEDIVNVI PL 

23712 tcctctaaagaaaacaaaaagtatttaaagatgaattttgatttg 

91 SSKENKKYLKMNFDL 

23667 aaatgggagtattatttaagattgtttttaaatttaattagcgcg 

106 KWEYYLRLFLNLI SA 

23622 caaaataattcagctatattaaaagaagttttcgataaaaaatac 

121 QNNSAILKEVFDKKY 

23577 caaaaaaacaacacagaattcatcactaaagattattttattgaa 

136 QKNNTEFITKDYFIE 

23532 tttatatctgatagtttagaaattgaaaataaattaaataaaatt 

151 FISDSLEIENKLNKI 

23487 gacagaaacattaataacatagtatcagcaattgataaggtaaaa 

166 DRNINNIVSAIDKVK 

23442 aaattaaaaggtaatagttacgcttgcataaattctttccagccg 

181 KLKGNSYACINSFQP 

23397 attagtaagtttcgcataagaaaagttttaccccaaaaaattaaa 

196 ISKFRIRKVLPQKIK 

23352 aatccagtaatagattcttcggatattatgttactgataaataga 

211 NPVIDSSDIMLLINR 

23307 attaataataatatattgcagatccctgatataagatga 23269 

226 INNNILQIPDIR* 
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Physico-chemical parameters of ORF 77ORF017 

1 MTHNIEKRIN KLKTSGNPKF KKLDSDIHYL LKRFEGEKNH KGFYPKFKQG 

EIVFVDFGIN 

61 VNKEFSNSHF AIVMNKNDSN TEDIVNVIPL SSKENKKYLK MNFDLKWEYY 

LRLFLNLISA 

121 QNNSAILKEV FDKKYQKNNT EFITKDYFIE FISDSLEIEN KLNKIDRNIN 

NIVSAIDKVK 

181 KLKGNSYACI NSFQPISKFR IRKVLPQKIK NPVIDSSDIM LLINRINNNI LQIPDIR 



Number of amino acids: 

Average molecular weight (Daltoiis): 

87.38 

Mean amino acid weight (Daltons): 

67 

Monoisotopic molecular weight (Daltons): 

69.83 

Mean amino acid monoisotopic weight (Daltons): 

59 

Amino acid composition 



237 
278 

117. 

278 

117. 



Aci 
d 


Symbo 
1 


Numb 
er 


% 


Average % 
in Swissprot 


Aci 
d 


Symbo 
1 


Numb 
er 


% 


Average % , 
in Swissprot 


Ala 


A 


5 


2.11% 


7.58% 


Cys 


C 


1 


0.42% 


1.66% 


Asp 


D 


14 


5.91% 


5.28% 


Glu 


E 


13 J 


5.49% 


6.37% 


Phe ! 


F 


16 


6.75% 


4.09% 


Gly i 


G 


6 


2.53% 


6.84% 


His : 


H 


4 | 


1.69% 


2.24% 


He 


I 


29 


12.24 

% 


5.81% 


Lys 1 


K 


33 


13.92 

% i 


5.95% j 


Leu | 


L 


19 | 


8.02% 


9.42% 


Met | 


M 


4 


1.69% | 


2.37% 


Asn 


N 


30 1 


12.66 ' 

% 


4.45% 


Pro 


P i 


7 j 


2.95% | 


4.9% 


Gin 


Q 


6 


2.53% ) 


3.97% 


Arg 


R 


8 1 


3.38% ! 


5.16% 


Ser 


s 


17 


7.17% 


7.12% 


Thr , 


X_. ! 


5 


2.11% I 


5.67% 


Val ] 


V 


11 


4.64% ; 


6.58% 


Tip 


w | 


1 0.42% 


1.23% 


Tyr j 


Y 


8 


3.38% 


3.18% 



Number of acidic (negative) amino acids (ED): 
Number of basic (positive) amino acids (KR): 
Total charge (KRED): 



27 

11.39% 
41 

17.30% 
68 

28.69% 
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Net charge (KR - ED): 


14 




5.91% 


Theontical pi: 


10.01 


Total linear charge density: 


0.30 


Average nyaropnobicity: 


-5.37 


Ratio of hydrophilicity to hydrophobicity: 


1.41 


Percentage of hydrophilic amino acid: 


57.81% 


Percentage of hydrophobic amino acid: 


42.19% 


Ratio of %hydrophilic to %hydrophobic: 


1.37 
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77ORF019 sequence 

39851 atgaacgagcaaataataggaagcatatatactttagcaggaggt 

1 MNEQIIGSIYTLAGG 

39896 gttgtgctttatt cagttaaagagatttttaggtat t t tacagat 

16 VVLYSVKEIFRYFTD 

39941 tctaacttacaacgtaaaaaaatcaatttagaacaaatatatccg 

31 SNLQRKKINLEQIYP 

39986 atatatttagattgttttaaaaaggctaaaaagatgattggagct 

46 IYLDCFKKAKKMIGA 

40031 tatattattccaacagaacagcatgaatttttagatttttttgat 

61 YIIPTEQHEFLDFFD 

40076 attgaagtctttaataatttagataagcaaagtaaaaaagcgtat 

76 IEVFNNLDKQSKKAY 

40121 gaaaatgttattggatttagacaaatgattaatttatcaaataga 

91 EMVIGFRQMINLSNR 

40166 gttaaggcaatggaagattttaagatgagtttcaacaatgaattt 

106 VKAMEDFKMS FNNEF 

40211 agtacaaatcagattttttttaatccttcttttgttatggaaaca 

121 STNQIFFNPSFVMET 

40256 attgctattataaatgaatatcaaaaagatatatcttatttaaaa 

136 IAIINEYQKDISYLK 

4 0301 aatataattaataaaatgaatgaaaatagagcttataatcatatt 

151 NI INKMNENRAYNHI 

40346 gatagttttatcacttcagagtaccgacgaaaaataaacgattat 

166 DSFITSEYRRKINDY 

40391 aatctttatcttgataaatttgaagaacagtttagtcaaaagttt 

181 NLYLDKFEEQFSQKF 

40436 aaaataaacagaacttcgataaaagaaagaattattattaattta 

196 KINRTSIKERI I INL 

40481 aacaagaggagatttaaatga 40501 

211 N K R R F K * 
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Physico-chemical parameters of ORF 77ORF019 

1 MNEQIIGSIY TLAGGWLYS VKEIFRYFTD SNLQRKKINL EQIYPIYLDC 

FKKAKKMIGA 

61 YIIPTEQHEF LDFFDIEVFN NLDKQSKKAY ENVIGFRQMI NLSfJRVKAME 

DFKMSFNNEF 

121 STNQIFFNPS FVMETIAIIN EYQKDISYLK NIINKMNENR AYNHIDSFIT 

SEYRRKINDY 

181 NLYLDKFEEQ FSQKFKINRT SIKERIIINL NKRRFK 

Number of amino acids: 216 
Average molecular weight (Daltons): 

260 

26.06 

Mean amino acid weight (Daltons): 

120. 

49 

Monoisotopic molecular weight (Daltons): 

260 

0934 

Mean amino acid monoisotopic weight (Daltons): 

120. 

41 



Amino acid composition 



Aci 
d 


Symboi 

1 i 


Numb 1 
er 


% 


Average % 
in Swissprot 


Aci ■ 
d 


Symbo 

1 ' 


Numb j 
er 


% ; 


Average % \ 
in Swissprot 


Ala 


A | 


7 


3.24% ; 


7.58% 


Cys..j 


C 


1 ! 


0.46% 


1.66% 1 


A sp \ 


D 1 


10 


4.63% ! 


5-28% . _ _ j 


Glu 1 


E 1 


16 j 


7.41% j 


6.37% ! 


Phe ; 


F 


19 


8.80% 1 


4.09% 


Gly 


G 


5 


2.31% 


6.84% j 


His ' 


H 


2 


0.93% ! 


2.24% j 


lie 


I 


28 j 


12.96 

% ' 


5.81% I 


Lys 


K \ 


22 


10.19 I 

% s 


5.95% j 


Leu { 


L 1 


12 ] 


5.56% i 


9.42% j 


Met ] 


M 


7 


3.24% \ 


2.37% 


Asn 1 


N 


23 


io.65 : 

% i 


4.45% ] 


Pro 


p ! 


3 ' 


1.39% 


4.9% 


Gin ; 


Q 


10 


4.63% : 


3.97% ! 


Arg 


R 


11 i 


5.09% ! 


5.16% 


Ser 


s 


13 


6.02% 


7.12% 


Thr i 


T 5 


1 


3.24% 


5.67% 


Val 


V 


7 


3.24% 


6.58% 


Ik.. 


W 


0 , 


0.00% i 


1.23% 


Tyr 


Y 


13 


6.02% 1 


3.18% 



Number of acidic (negative) amino acids (ED): 


26 




12.04% 


Number of basic (positive) amino acids (KR): 


33 




15.28% 


Total charge (KRED): 


59 




27.31% 


Net charge (KR- ED): 


7 




3.24% 
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Theoritical pi: 


9.52 


Total linear charge density: 


0.28 


Average hydrophobicity: 


-4.84 


Ratio of hydrophilicity to hydrophobicity: 


1.37 


Percentage of hydrophilic amino acid: 


54.17% 


Percentage of hydrophobic amino acid: 


45.83% 


Ratio of %hydrophilic to %hydrophobic: 


1.18 
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77ORF043 sequence 



29304 atgtattacgaaataggcgaaatcatacgcaaaaatattcatgtt 

1 MYYEIGEIIRKNIHV 

2 9349 aacggattcgattttaagctattcattttaaaaggtcatatgggc 
16 NGFDFKLFILKGHMG 

2 93 94 atatcaatacaagt taaagatatgaacaacgtaccaatt aaacat 

31 I S IQVKDMNNVPIKH 

2 9439 gcttatgtcgtagatgagaatgacttagatatggcatcagactta 

46 AYVVDENDLDMASDL 

29484 tttaaccaagcaatagatgaatggattgaagagaacacagacgaa 

61 FNQAIDEWIEENTDE 

2 9529 caggacagactaattaacttagtcatgaaatggtag 29564 

76 QDRLINLVMKW* 
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Physico-chemical parameters of ORF 77ORF043 

1 MYYEIGEIIR KNIHVNGFDF KLFILKGHMG ISIQVKDMNN VPIKHAYWD 

ENDLDMASDL 

61 FNQAIDEWIE ENTDEQDRLI NLVMKW 



Number of amino acids: 

Average molecular weight (Daltons): 

86.68 

Mean amino acid weight (Daltons): 

45 

Monoisotopic molecular weight (Daltons): 
80.02 

Mean amino acid monoisotopic weight (Daltons): 

37 

Amino acid composition 



86 
101 

118. 

101 

118. 



Aci 
d 


Symbo 
1 


Numb 
er 


% 


Average % 
in Swissprot_ 


Aci 
d 


Symbo 
1 


Numb ; 
er 


% 1 


Average % j 
in Swissprot j 


Ala ; 


A 


3 ! 


3.49% 


7.58% 


Cys . 


C 1 


0 


0.00% j 


1.66% 


Asp 1 


D 


9 


10.47 1 

% 1 


5.28% 


Glu j 


E 


7 


8.14% j 


6.37% 


Phe 


F 


4 


4.65% 


4.09% 


Gly 


G 


4 


4.65% 


6.84% 


His 


H 1 


3 


3.49% 


2.24% 


He I 


I 


11 


12.79 
% 


5.81% j 


Lys 


K 1 


6 


6.98% 


5.95% 


Leu j 


L 


6 


6.98% 


9.42% 


Met ! 


M 


5 


5.81% 


2.37% 


Asn i 


N 


8 


9.30% 


4.45% 


Pro 
Arg 


P 


1 


1.16% 


4.9% 


Gin 


Q 


3 


3.49% 


3.97% 


R 


2 


2.33% 


5.16% 


Ser 


s 


2 


2.33% 


7.12% 


Thr 


T 


1 


1.16% 


5.67% 


Val 


V 


6 


6.98% 


6.58% 


Trp 


W 


2 


2.33% 


1.23% 


Tyr 


Y 


3 


3.49% 


3.18% 



Number of acidic (negative) amino acids (ED): 

Number of basic (positive) amino acids (KR): 

Total charge (KRED): 

Net charge (KR- ED): 

Theoritical pi: 

Total linear charge density: 

Average hydrophobicity: 

Ratio of hydrophilicity to hydrophobicity: 



16 

18.60% 
8 

9.30% 
24 

27.91% 
-8 

-9.30% 

4.38 

0.30 

-2.80 

1.19 
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Percentage of hydrophilic amino acid: 48.84% 
Percentage of hydrophobic amino acid : 51.16% 
Ratio of %hydrophi!ic to %hydrophobic: 0.95 
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77ORF102 sequence 



29051 




atgagcaacatttataaaagctacctagtagcagtattatgcttc 


1 M 


S 


NIYKSYLVAVLCF 


29096 




acagtcttagcgattgtacttatgccgtttctatacttcactaca 


16 T 


V 


LAIVLMPFLYFTT 


29141 




gcatggtcaattgcgggattcgcaagtatcgcaacattcatgtac 


31 A 


W 


SIAGFASIATFMY 


29186 




tacaaagaatgctttttcaaagaataa 29212 


46 Y 


K 


E C F F K E * 
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Physico-chemical parameters of ORF 77ORF102 

1 MSNIYKSYLV AVLCFTVLAI VLMPFLYFTT AWSIAGFASI ATFMYYKECF FKE 

Numb er of amino acids : 53 
Average molecular weight (Daltons): 

55.42 

Mean amino acid weight (Daltons): 

11 

6.14 

Monoisotopic molecular weight (Daltons): 

61 

51.07 

Mean amino acid monoisotopic weight (Daltons): 

11 

6.06 



Amino acid composition 



Aci 
d 


Symbo 
1 


Numb 
er 


% | 


Average % 
in Swissprot 


Aci ! 
d 


Symbo 
1 


Numb 
er 


% | 


Average % 
in Swissprot | 


Ala 


A 


6 i 


11.32 

% 


7.58% ! 


Cys i 


C 


2 


3.77 ; 
% 


1.66% 


Asp ' 


D 


0 j 


0.00% ! 


5.28% | 


Glu | 


E 


2 


3.77 I 
% 


6.37% | 


Phe i 


F 


7 | 


13.21 

% 


4.09% 


Gly 


G 


1 


1.89 ! 

% ) 


6.84% 


His 


H ! 


0 


0.00% i 


2.24% 


lie 


I 


4 


7.55 ! 

% j 


5.81% 


Lys 


K 


3 


5.66% 1 


5.95% 


Leu . 


L 


5 


9.43 | 

% ! 


9.42% 


Met 


M 


3 


5.66% 


2.37% 


Asn 


N 


1 


1.89 1 
% 


4.45% 


Pro 


P 


1 


1.89% 


4.9% 


Gin 


Q 


0 j 


0.00 

% 


3.97% 


Arg 


R 


0 


0.00% 


5.16% 


Ser 


S 


4 


7.55 

% 


7.12% ! 


Thr 


T 


4 


7.55% 


5.67% 


Val 


V 


4 


7.55 

% 


6.58% ) 


Trp 


W 


1 


1.89% 


1.23% 


Tyr 


Y 


5 


9.43 

% 


3.18% 1 



Number of acidic (negative) amino acids (ED): 

Number of basic (positive) amino acids (KR): 

Total charge (KRED): 

Net charge (KR- ED): 

Theoritical pi: 

Total linear charge density: 



2 3.77% 

3 5.66% 
5 9.43% 
1 1.89% 
8.18 

0.13 
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Average hydrophobicity : 10.81 

Ratio of hydrophilicity to hydrophobicity: 0.40 

Percentage of hydrophilic amino acid: 28.30% 

Percentage of hydrophobic amino acid: 7 1 .70% 

Ratio of %hydrophilic to %hydrophobic: 0.39 



Cl 
€! 
=P 
HI 

IV 

II! 
fli 

u 

ru 

a 
m 

€ 
6 



SD-63311.4 



118 



248/037 



77ORF104 sequence 

34393 atggtaaccaaagaatttttaaaaactaaacttgagtgttcagat 
1 MVTKEFLKTKLECSD 
34438 atgtacgctcagaaactcatagatgaggcacagggcgatgaaaat 
16 MYAQKLIDEAQGDEN 
34483 aggttgtacgacctatttatccaaaaacttgcagaacgtcataca 
31 RLYDLFIQKLAERHT 
34528 cgccccgctatcgtcgaatattaa 34551 

46 RPAIVEY* 



SD-63311.4 



119 



248/037 



Physico-chemical parameters of ORF 77ORF104 

1 MVTKE FLKTK LECSDMYAQK LIDEAQGDEN RLYDLFIQKL AERHTRPAIV EY 

Number of amino acids: 52 
Average molecular weight (Daltons): 

61 

93.13 

Mean amino acid weight (Daltons): 

11 

9.10 

Monoisotopic molecular weight (Daltons): 

61 

89.12 

Mean amino acid monoisotopic weight (Daltons): 

11 

9.02 



Amino acid composition 



Aci 
d 


Symbo 

1 1 


Numb | 
er 


% 


Average % 
in Swissprot 


Aci 

d ' 


Symbo 
1 


Numb 
er 


% ; 


Average % ! 
in Swissprot j 


Ala | 


A 


4 j 


7.69 | 
% 


7.58% 


Cys ! 


C 


1 


1.92% ] 


1.66% 


Asp 


D 


4 I 


7.69 

% 


5.28% 


Glu 1 




6 


11.54 

% 


6.37% \ 


Phc 


F 


2 


3.85 1 
% 


4.09% 


Gly : 


G 


1 | 


1.92% | 


6.84% 


His 


H 


1 


1.92 

% 


2.24% 


He 


I 


3 I 


5.77% j 


5.81% j 


Lys i 


K 1 


5 


9.62 
% 


5.95% 


Leu ; 


L 


6 


11.54 

% i 


i 

9.42% j 


Met 


M 


2 


3.85 i 

% _ 


2.37% 


Asn 


N 


1 


1.92% j 


4.45% | 


Pro 


P 


1 


1.92 
% 


4.9% 


Gin 


Q 


3 


5.77% 


3.97% 


Arg 


R 


3 


5.77 

% 


5.16% 


Ser 


s 


1 


1.92% 


7.12% 


Thr 


T 


3 


5.77 

% 


5.67% 


Val 


V 


2 


3.85% 


6.58% 1 


Trp 


W 


0 


0.00 

% 


1.23% 


Tyr 


Y 


3 


5.77% 


3.18% 



Number of acidic (negative) amino acids (ED) : 1 0 

19.23% 

Number of basic (positive) amino acids (KR): 8 

15.38% 

Total charge (KRED): 1 8 

34.62% 
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Net charge (KR-ED): 


-2 


-3.85% 


Theoritical pi: 


5.03 


Total linear charge density: 


0.38 


Average hydrophobicity: 


-5.01 


Ratio of hydrophilicity to hydrophobicity: 


1.47 


Percentage of hydrophilic amino acid: 


53.85% 


Percentage of hydrophobic amino acid: 


46.15% 


Ratio of %hydrophilic to %hydrophobic: 


1.17 



a 

m 

Hi 
m 

ni 

M 

m 

a 
m 
« 
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770RF182 sequence 

29268 atgttcaatataaaacgaaaaacggaggaagtcaagatgtattac 

1 MFNI KRKTEEVKMYY 
29313 gaaataggcgaaatcatacgcaaaaatattcatgttaacggattc 
16 EIGEIIRKNIHVNGF 
29358 gattttaagctattcattttaaaaggtcatatgggcatatcaata 
31 DFKLFILKGHMGISI 

2 9403 caagttaaagatatgaacaacgtaccaattaaacatgcttatgtc 
46 QVKDMNNVP I KHAYV 
29448 gtagatgagaatgacttagatatggcatcagacttatttaaccaa 
61 VDENDLDMASDLFNQ 
29493 gcaatagatgaatggattgaagagaacacagacgaacaggacaga 
76 AIDEWIEENTDEQDR 
29538 ctaattaacttagtcatgaaatggtag 29564 

91 LINLVMKW* 
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Physico-chemical parameters of ORF 770RF182 

1 MFNIKRKTEE VKMYYEIGEI IRKNIHVNGF DFKLFILKGH MGISIQVKDM 

NNVPIKHAYV 

61 VDENDLDMAS DLFNQAIDEW IEENTDEQDR LINLVMKW 

Number of amino acids: 
Average molecular weight (Daltons): 

91.50 

Mean amino acid weight (Daltons): 

30 

Monoisotopic molecular weight (Daltons): 

83.84 

Mean amino acid monoisotopic weight (Daltons): 

22 



Amino acid composition 



Aci 
d 


Symbo 

1 1 


Numb 
er 


% 


Average % 1 
in Swissprot J 


Aci ! 
d 


Symbo 
1 


Numb , 
er 


% j 


Average % j 
in Swissprot i 


Ala 


A 


3 1 


3.06 1 

% 1 


7.58% 


Cys 


c I 


o 


0.00% I 


1.66% j 


Asp ; 


D 


9 j 


9.18 

% 


5.28% 


Glu ; 


E 


9 


9.18% 


6.37% 


Phe 1 


F | 


5 j 


5.10 

% 


4.09% 


Gly | 


G 


4 


4.08% | 


6.84% ! 


His 


H ■ 


3 


3.06 

% j 


2.24% 


He 


I 


12 


12.24 ! 

% 


5.81% ? 


Lys 1 


K 


9 ] 


9.18 I 

% 1 


5.95% 


Leu 


L 


6 


6.12% 1 


9.42% 


Met 


M 


6 | 


6.12 

% 


2.37% 


Asn 


N 


9 


9.18% ; 


4.45% j 


Pro 


P I 


1 j 


1.02 
% 


4.9% 


Gin 


Q 


3 


3.06% ] 


3.97% j 


Arg ■ 


R 


3 j 


3.06 

% 


5.16% 


Ser 


s 


2 


2.04% 


7.12% j 


Thr 


T 


2 


2.04 
% 


5.67% 


Val 


V 


7 


7.14% 


6.58% j 


Tip 


W 


2 


2.04 

% 


1.23% 


Tyr 


Y 


3 


3.06% 


3.18% I 

._J 



98 
116 

119. 

116 

119. 



Number of acidic (negative) amino acids (ED): 
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Number of basic (positive) amino acids (KR): 12 

12.24% 

Total charge (KRED): 30 

30.61% 

Net charge (KR - ED): -6 

-6.12% 

Theoritical pi: 476 
Total linear charge density: 0.33 
Average hydrophobicity: -3-89 
Ratio of hydrophilicity to hydrophobicity: 1 28 

Percentage of hydrophilic amino acid: 5 1 .02% 

Percentage of hydrophobic amino acid: 48.98% 
Ratio of %hydrophilic to %hydrophobic: 1 04 
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Table 5 



BLASTP 2.0.8 [Jan-05-1999] 

Query* sid | 100017 | lan | 77ORF017 Phage 77 ORF | 23269-23982 | -3 
(237 letters) 



Database: nr 

393,678 sequences; 120,452,765 total letters 



Sequences producing significant alignments: 
Value 

gi|4493986|emb|CAB39045.l| (AL034559) predicted using hexExon; . 
0.010 

gi|730607|sp|P23250|RPIl_YEAST NEGATIVE RAS PROTEIN REGULATOR P. 
0.053 

gi| 3097044 |eTnb|CAA75299| (Y15035) KIR [Cowpox virus] 
0.090 

gi|2146245|pir| |S73794 hypothetical protein H91_orfl80 - Mycopl . 
0.090 

gi|83910|pir| |S04682 ribosomal protein varl - yeast (Candida gl . 
0.15 

gi | 133135 | Sp | P213 58 | RMAR_CANGA MITOCHONDRIAL RIBOSOMAL PROTEIN . 
0.15 

gi|2128843 |pir| |H64475 hypothetical protein MJ1409 - Methanococ . 
0.20 

gi|5107017|gb|AAD39926.l|AF126285_2 (AF126285) RNA polymerase [. 
0.35 

gi|2146210|pir| |S73342 hypothetical protein E07_orfl66 - Mycopl. 
0.60 

Database: swissprot 

79,449 sequences; 28,874,452 total letters 



Score 
(bits) 



41 
38 
38 
38 
37 
37 
36 
36 
35 



Sequences producing significant alignments: 

sp|P23250 RPI1_YEAST NEGATIVE RAS PROTEIN REGULATOR PROTEIN. 

Sp|P21358 RMAR__CANGA MITOCHONDRIAL RIBOSOMAL PROTEIN VAR1 . 

Sp|Q21444 LDLC_CAEEL LDLC PROTEIN HOMOLOG. 

sp|P27240 RFAY_ECOLI LIPOPOLYSACCHARIDE CORE BIOSYNTHESIS PROT. 

sp|P53192 YGC0_YEAST HYPOTHETICAL 27.1 KD PROTEIN IN ALK1-CKB1. 

sp|P32908 SMC1_YEAST CHROMOSOME SEGREGATION PROTEIN SMC1 (DA-B. 

sp|P54683 TAGB_DICDI PRESTALK- SPECIFIC PROTEIN TAGB PRECURSOR . 

sp|Q03100 CYAA_DICDI ADENYLATE CYCLASE, AGGREGATION SPECIFIC (. 



Score E 

(bits) Value 

38 0.014 

37 0.040 

34 0.35 

33 0.46 

33 0.60 

33 0.60 

32 0.78 

32 0.78 
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BLASTP 2.0.8 [Jan-05-1999] 

Query= sid| 100019 | lan | 77ORF019 Phage 77 ORF | 39851-40501 | 2 
(216 letters) 

Database: nr 

373,355 sequences; 114,214,446 total letters 



Sequences producing significant alignments: 



Score 
(bits) 



gi | 3341966 
gi|2689911 
gi | 1171589 
gi|4493986 
gij 141257 | 
gi | 133412 j 
gi|3122231 
gi|3649757 
gi|2688313 



|dbj |BAA31932| (AB009866) orf 59 [bacteriophage phi PVL] 
(AE000792) B. burgdorferi predicted coding region BB. 

|emb|CAA64574| (X95275) frameshift [Plasmodium falcip. 

|emb|CAB39045.l| (AL034559) predicted using hexExon; . 

sp|P18019|YPI9_CLOPE HYPOTHETICAL 14.5 KD PROTEIN (OR. 

sp|P27059|RPOB__ASTLO DNA- DIRECTED RNA POLYMERASE BETA. 

|sp|Q5885l|HISX_METJA HISTIDINOL DEHYDROGENASE (HDH) . 

|emb|CAB11106.l| (Z98547) predicted using hexExon; MA. 
(AE001146) sensory transduction histidine kinase, pu. 



437 
38 
37 
36 
36 
35 
35 
34 
34 



E 

Value 

3-122 
0.058 
0 .10 
0.23 
0.29 
0.51 
0.51 
0.66 
0.87 



Database: swissprot 

79,449 sequences; 28,874,452 total letters 



Sequences producing significant alignments: 



sp 
sp 
sp 
sp 
sp 
sp 
sp 
sp 
sp 
sp 



P18019 
Q58851 
P27059 
Q02224 
P04931 
P18011 
P18709 
Q64409 
P21358 
Q03945 



YPI9__CLOPE 

HISX_METJA 

RPOB_ASTLO 

CENE_HUMAN 

ARP_PLAFA 

IPAB_SHIFL 

VTA2_XENLA 

CP3H_CAVPO 

RMAR_CANGA 

IPAB SHIDY 



HYPOTHETICAL 14.5 KD PROTEIN (ORF9) . 
HISTIDINOL DEHYDROGENASE (EC 1.1.1.23) (H. 
DNA- DIRECTED RNA POLYMERASE BETA CHAIN (E. 
CENTROMERIC PROTEIN E (CENP-E PROTEIN) . 
ASPARAGINE-RICH PROTEIN (AG319) (ARP) (FRA. 
62 KD MEMBRANE ANTIGEN. 

VITELLOGENIN A2 PRECURSOR (VTG A2) [CONTA. 
CYTOCHROME P450 3A17 (EC 1.14.14.1) (CYPI . 
MITOCHONDRIAL RIBOSOMAL PROTEIN VAR1 . 
62 KD MEMBRANE ANTIGEN. 



Score 




E 


(bits) 




Value 


36 


0 


079 


35 


0 


14 


35 


0 


14 


34 


0 


.31 


33 


0 


.53 


32 


0 


.69 


32 


0 


.90 


32 


0 


.90 


32 


0 


.90 


32 


1 


.2 
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BLASTP 2.0.8 [Jan-05-1999] 

Query= sid| 100043 | lan | 77ORF043 Phage 77 ORF | 29304-29564 | 3 
(86 letters) 

Database: nr 

373,355 sequences; 114,214,446 total letters 



Sequences producing significant alignments: 



Score E 
(bits) Value 



gi | 3341947 |dbj |BAA31913| (AB009866) orf 39 [bacteriophage phi PVL] 182 
gi|744518|prf | |2014422A FKBP-rapamycin-associated protein [Homo. ^° 
gi| 1169736 |sp|P42346|FRAP_RAT FKBP-RAPAMYCIN ASSOCIATED PROTEIN. 
gi| 1169735 |sp|P42345|FRAP_HUMAN FKBP-RAPAMYCIN ASSOCIATED PROTE . 
gi | 3282239 (U88966) rapamycin associated protein FRAP 2 [Homo sa. 
gi|3875402|emb|CAA98122| (Z73906) cDNA EST EMBL:D64 544 comes fr. 
gi| 1084792|pir| |S54091 hypothetical protein YPR070w - yeast (Sa. 

Database : swissprot 

79,449 sequences; 28,874,452 total letters 



32 
32 
32 
32 
31 
30 



6e-46 
0.84 
.84 
.84 
.84 
.5 
.2 



Sequences producing significant alignments: 

sp|P42345 FRAP_HUMAN FKBP-RAPAMYCIN ASSOCIATED PROTEIN (FRAP) . 
spjP42346 FRAP_RAT FKBP-RAPAMYCIN ASSOCIATED PROTEIN (FRAP) (R. 
sp|P34554 YNP1_CAEEL HYPOTHETICAL 42.2 KD PROTEIN T05G5.1 IN C. 
sp|Q24118 LIO_DROME LINOTTE PROTEIN. 

sp|P80034 ACH2_BOMMO ANTICHYMOTRYPSIN II (ACHY- II) . 

spjP22922 A1ATJBOMMO ANTITRYPSIN PRECURSOR (AT). 

sp|Q44363 TRAA_AGRT6 CONJUGAL TRANSFER PROTEIN TRAA. 

sp|P38255 YBU5_YEAST HYPOTHETICAL 51.3 KD PROTEIN IN PH05-VPS1. 

sp|P55822 SH3B_HUMAN SH3BGR PROTEIN (21-GLUTAMIC ACID-RICH PRO. 

sp|Q58482 YA82_METJA HYPOTHETICAL PROTEIN MJ1082. 

sp|P34252 YKK8_YEAST HYPOTHETICAL 52.3 KD PROTEIN IN HAP4-AAT1, 



Score 


E 


(bits) 


Value 


32 


0.24 


32 


0.24 


28 


3.5 


28 


3.5 


28 


3 .5 


28 


3.5 


28 


3 .5 


27 


6.0 


27 


7.9 


27 


7.9 


27 


7.9 
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BLASTP 2.0.8 [ Jan-05-1999] 

Query= sid | 100102 | lan | 77ORF102 Phage 77 ORF | 29051-29212 ) 2 
(53 letters) 

Database: nr 

373,355 sequences; 114,214,446 total letters 

Score E 

Sequences producing significant alignments: (bits) Value 

gi | 3341946 | dbj |BAA31912| (AB009866) orf 38 [bacteriophage phi PVL] 96 3e-20 
gij 4325288 |gb|AAD17315| (AF123593) voltage- dependent sodium cha. . . 28 7.1 
gi|2649684 (AE001040) a. fulgidus predicted coding region AF092... 28 9.3 

Database: swissprot 

79,449 sequences; 28,874,452 total letters 

Score E 

Sequences producing significant alignments: (bits) Value 

sp|P42087 HUTMJ3ACSU PUTATIVE HISTIDINE PERMEASE. 26 7.1 

sp|P04775 CIN2_RAT SODIUM CHANNEL PROTEIN, BRAIN II ALPHA SUBU... 26 9.2 
sp|P42619 YQJF_ECOLI HYPOTHETICAL 17.2 KD PROTEIN IN EXUR-TDCC... 26 9.2 
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BLASTP 2.0.8 [ Jan-05-1999] 

Query= sid | 100104 | lan | 77ORF104 Phage 77 ORF | 343 93-34551 | 1 
(52 letters) 

Database: nr 

373,355 sequences; 114,214,446 total letters 



Sequences producing significant alignments: 



Score 
(bits) 



gi | 2315523 (AF016452) similar to the leucine-rich domains found... 29 
gi|4377168|gb|AAD18990| (AE001666) CT711 hypothetical protein [... 29 
gi | 3882171 | dbj |BAA34445| (AB018268) KIAA0725 protein [Homo sapi . . . 28 



E 

Value 

4.2 
5.4 
9.3 



Database: swissprot 

79,449 sequences; 28,874,452 total letters 



Score E 

Sequences producing significant alignments: (bits) Value 

sp|P04879 RRPP_VSVIG RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7.48. 27 5.4 

sp|P04880 RRPP_VSVIM RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7.48. 27 5.4 

sp|Q13946 CN7A_HUMAN HIGH- AFFINITY CAMP -SPECIFIC 3', 5 '-CYCLIC . 26 7.1 

sp|P35381 ATPA_DROME ATP SYNTHASE ALPHA CHAIN, MITOCHONDRIAL P. 26 9.3 

sp|P54659 MVPB_DICDI MAJOR VAULT PROTEIN BETA { MVP- BETA) . 26 9.3 

sp|P40397 YHXC_BACSU HYPOTHETICAL OXIDOREDUCTASE IN APRE-COMK . 26 9.3 



SD-63311.4 



129 



248/037 



BLASTP 2.0.8 [Jan-05-1999] 

Query= sid| 122748 | lan| 770RF182 Phage 77 ORF | 29268-29564 [ 3 
{98 letters) 



Database : nr 



393,678 sequences; 120,452,765 total letters 



Sequences producing significant alignments: 



9 1 

gi 
gi 
gi 
gi 
gi 
gi 



3341947 1 dbj BAA31913.l| (AB009866) orf 39 [bacteriophage phi. 
1084792 | pir |S54091 hypothetical protein YPR070w - yeast (Sa. 
1169736 |sp|P42346|FRAP_RAT FKB P - RAPAMYC I N ASSOCIATED PROTEIN. 
744518 |prf | | 2014422A FKBP-rapamyc in- associated protein [Homo. 
505138l|emb|CAB44736.l| (AL049653) dJ647M16.2 (FK506 binding. 
4826730 jref |NP_004949 . 1 |pFRAPl | FK506 binding protein 12 -rap. 
3282239 (U88966) rapamycin associated protein FRAP 2 [Homo sa. 



Score 
(bits) 

. 182 
35 
32 
32 
32 
32 
32 



Database: swissprot 

79,909 sequences; 29,054,478 total letters 



E 

Value 

8e-46 

0.13 

1.1 

1.1 

1.1 

1.1 

1.1 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



sp|P42345 FRAP_HUMAN FKB P- RAPAMYCIN ASSOCIATED PROTEIN (FRAP) . 32 0.29 

sp|P42346 FRAPJRAT FKB P -RAPAMYCIN ASSOCIATED PROTEIN (FRAP) (R. 32 0.29 

sp|P40557 YIA5_YEAST PUTATIVE DISULFIDE ISOMERASE YIL005W PREC. 29 3.3 

sp|Q24118 LIO_DROME LINOTTE PROTEIN. 28 4.4 

sp|Q44363 TRAA_AGRT6 CONJUGAL TRANSFER PROTEIN TRAA. 28 4.4 

Sp | P80 03 4 ACH2JBOMMO ANTICHYMOTRYPSIN II (ACHY- II ) . 28 4.4 

sp|P34554 YNP1_CAEEL HYPOTHETICAL 42.2 KD PROTEIN T05G5.1 IN C . 28 4.4 

sp|P22922 A1AT_B0MM0 ANTITRYPSIN PRECURSOR (AT). 28 4.4 
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1st 










3rd 


position 




2nd position 
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(5' end) 










(3' end) 




u 


C 


A 


f+ 






Phe 


Ser 


Tyr 


Cys 


U 


u 


Phe 


Ser 


Tyr 


Cys 


c 




Leu 


Ser 


Stop 


Stop 


A 




Leu 


Ser 


StOD 


Trp 


G 




Leu 


Pro 


His 


Arg 


U 




Leu 


Pro 


His 


Arg 


c 


C 


Leu 


Pro 


Gin 


Arg 


A 




Leu 


Pro 


Gin 


Ara 


G 




lie 


Thr 


Asn 


Ser 


U 




lie 


Thr 


Asn 


Ser 


C 


A 


lie 


Thr 


Lys 


Arg 


A 




Met 


Thr 


Lvs 


Ara 


G 




Val 


Ala 


Asp 


Gly 


U 




Val 


Ala 


Asp 


Gly 


c 


G 


Val 


Ala 


Glu 


Gly 


A 




Val 


Ala 


Glu 


Glv 


G 
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Table 7 



Bacteriophage 3A, complete genome sequence 

1 caaacgctag caacgcggat aaatttttca tgaaaggggg tctttatatg aagttaacaa aaaaacagct 

71 aaaagaatat atagaagatt acaaaaaatc tgatgacata ttaattaatt tgtatataga aacatatgaa 

141 ttttattgtc ggttaagaga tgaacttaaa aatagtgatt taatgataga gcatacaaac aaggctggtg 

211 cgagcaatat tattaagaat ccattaagca tagaactgac aaaaacagtt caaacactaa ataacttact 

281 caagtctatg ggtttaactg cagcacaaag aaaaaagata gttcaagaag aaggtggatt cggtgactat 

351 taaagtttta aatgaacctt caccaaaact attaacaaca tggtatgcag agcaagtcac tcaagggaaa 

421 ataaaaacaa gcaaatatgt tagaaaagaa tgtgagagac atcttagata tctagaaaat ggaggtaaat 

491 gggtatttga tgaagaatta gcgcatcgtc ctattcgatt tatagaaaag ttttgtaaac cttccaaagg 

561 atctaaacgt caacttgtat tacagccatg gcaacatttt attatcggca gtttgtttgg ttgggttcat 

631 aaagaaacaa aactgcgcag gtttaaagaa gctttgatat ttatggggcg aaaaaatggt aaaacaacca 

701 ctatttctgg ggttgctaac tatgctgtat cacaagatgg agaaaatggt gcagaaattc atttgttagc 

771 aaacgtaatg aaacaagcta ggattctatt tgatgaatct aaggcgatga ttaaagctag cccaaagctt 

841 gataaaaatt tcagaacatt aagagatgaa atccattatg acgcaacgat atcaaaaatt atgccccaag 

911 catcagatag cgataagtta gatggattga atacacacat ggggattttt gatgaaattc atgaatttaa 

981 agactataaa ttgatttcag ttataaaaaa ctcaagagct gcaaggttac aacctcttct catctacatt 

1051 acgacagcag ggtatcaatt agatggtcca cttgttgata tggtagaagc gggaagagac accttagatc 

1121 aaatcataga agacgaaaga actttttatt atttagcatc tttggatgat gacgatgata ttaatgattc 

1191 gtcgaactgg ataaaagcaa atcccaactt aggtgtctct ataaatttag atgagatgaa agaagagtgg 

1261 gaaaaagcta agagaacacc agctgaacgt ggagatttta taaccaaaag gtttaatatc tttgctaata 

1331 atgacgagat gagttttatt gattacccaa cactccaaaa aaataatgaa attgtttctt tagaagagct 

1401 ggaaggcaga ccgtgcacga ttggttatga tttatcagaa acagaggact ttacagccgc gtgtgctact 

1471 tttgcgttag ataatggtaa agttgcagtt ttatcgcatt catggattcc taagcacaaa gttgaatatt 

1541 ctaacgaaaa aataccctat agagaatggg aagaagatgg cttattaaca gtgcaagata agccttatat 

1611 tgactaccaa gatgttttaa attggataat taagatgaat gagcattatg tagtagaaaa aattacttat 

1681 gatagagcga acgcattcaa actaaatcaa gagttaaaaa attacgggtt tgaaacggaa gaaacaagac 

1751 aaggagcttt gaccttgagc cctgcattga aggatttaaa agaaatgttt ttagatggga aaataatatt 

1821 taataataat cctttaatga aatggtatat caataatgtt cagttgaaac tagacagaaa cggaaactgg 

1891 ttgccgtcta agcaaagcag atatcgtaaa atagatggct ttgcagcatt tttaaacaca tatacagata 

1961 ttatgaataa agttgtttct gatagtggtg aaggaaacat agagtttatt agtattaaag acataatgcg 

2031 ttaaggaggt gaatgttatc gcaaaagaga atattgtcac acgcataaag aaaaaattga tagacaattg 

2101 gattgatcag tcaacttcta agctttatga ctttagccca tggaaaaata gatctttttg gggtgtaatt 

2171 aataatacgc ttgaaactaa tgaaacgata ttttcagcta ttacaaagtt atctaattcg atggctagtt 

2241 tgcccttgaa aatgtatgaa gattataaag tagttaatac agaagtatct gatttactta cagtgtcacc 

2311 gaataattct ctgagcagtt ttgattttat taatcaaatt gaaacaatca gaaatgaaaa aggtaatgca 

2381 tatgtgctaa ttgaacgaga catctatcat caaccatcaa agcttttctt attaaatcca gatgttgttg 

2451 aaatgttaat tgaaaaccaa tcacgtgaac tttattattc cattcatgct gcaactggaa ataaattgat 

2521 tgttcataat atggacatgt tgcattttaa acacatcgtg gcatctaata tggtgcaagg cattagtccg 

2591 attgatgtgt tgaagaatac aactgatttt gataatgcag taagaacctt taatcttaca gaaatgcaaa 

2661 aacctgattc tttcatgctt aaatatggtt ccaatgtagg taaagaaaaa aggcagcaag tgttagaaga 

2731 tttcaaacag tactatgaag aaaacggtgg aatattattc caagagcctg gtgttgaaat cgaaccgtta 

2801 cctaaaaaat atgtctctga agatatagtg gcaagcgaga atttaacaag agaaagagta gctaacgttt 

2871 ttcaattgcc ctcagtattc ttaaatgcaa gatcaaatac aaatttcgcg aaaaatgaag agttaaacag 

2941 attttacttg cagcatacct tattgccaat cgtcaaacag tatgaagaag aatttaatcg gaaactactt 

3011 actaaaacag acagagaaaa aaataggtat tttaaattta acgttaaatc ttatttaagg gctgatagtg 

3081 caacacaagc agaagtgtac tttaaagcag ttcgtagtgg ttactacact ataaatgaca ttagagagtg 

3151 ggaagattta ccaccagttg aaggtggaga taagccgcta ataagcggtg atttataccc aattgacacg 

3221 ccacttgaat taagaaaatc tttgaaaggt ggtgataaaa atgtcaatga aagctaagta ttttcaaatg 

3291 aaaagaaaat caaaaagtaa aggtgaaata tttatttatg gtgatattgt aagtgataaa tggtttgaaa 

3361 gtgatgtaac tgctacagat ttcaaaaata aactagatga actaggagac atcagtgaaa tagatgttca 

3431 tataaattca tctggaggca gtgtatttga agggcatgca atatacaata tgctaaaaat gcatcctgca 

3501 aaaattaata tctatgtcga tgccttagcg gcatcaattg ctagtgttat cgctatgagt ggtgacacta 

3571 tttttatgca caaaaatagt tttttaatga ttcataattc atgggttatg actgtaggta atgcagaaga 

3641 gttaagaaag acagcggatt tacttgaaaa aacagatgct gttagtaatt cagcttattt agataaagca 

3711 aaagatttag atcaagaaca cttaaaacag atgttagatg cagaaacttg gcttactgca gaagaagcct 

3781 tgtctttcgg cttgatagat gaaattttag gagctaatga aataactgct agtatctcta aagagcaata 

3851 taagcgtttc gagaacgtcc cagaagattt aaagaaagat gtagacaaaa tcactaaaat cgatgatgta 

3921 gatacgtttg aattggttga aacacctaaa gaaagtatgt cactagaaga aaaagaaaaa agagaaaaaa 

3991 ttaaacgcga atgcgaaatt ttaaaaatga caatgagtta ttaggaggaa atgaaatgcc gacattatat 

4061 gaattaaaac aatccttagg tatgattgga caacaattaa aaaataaaaa tgatgaattg agtcagaaag 

4131 caacagaccc aaatattgat atggaagaca tcaaacaact agaaacagaa aaagcaggct tacaacaaag 

4201 atttaacatt gttgaaagac aagtaaaaga cattgaagaa aaagaaaaag cgaaagttaa agacacagga 

4271 gaagcttatc aatctttaaa tgatcatgag aagatggtta aagctaaggc agagttttat cgtcacgcga 

4341 ttttaccaaa tgaatttgaa aaaccttcaa tggaggcaca acgtttatta cacgctttac caacaggtaa 

4411 tgattcaggt ggtgataagc tcttaccaaa aacactttct aaagaaattg tttcagaacc atttgctaaa 

4481 aaccaattac gtgaaaaagc tcgtctaact aacattaaag gtttagagat tccaagagtt tcatatactt 

4551 tagacgatga tgacttcatt acagatgtag aaacagcaaa agaattaaaa ttaaaaggtg atacagttaa 

4621 attcactact aataaattca aagtatttgc tgcaatttca gatactgtaa ttcatggatc agatgtagat 

4691 ttagtaaact gggttgaaaa cgcactacaa tcaggtctag cagctaaaga acgtaaagat gccttagcag 
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4761 taagtcctaa atctggatta gatcacatgt cattttacaa tggatctgtt aaagaagttg agggagcaga 

4831 catgtatgat gctattatta acgctttagc agatttacat gaagattacc gtgataacgc aacaatttat 

4901 atgcgatatg cggattatgt caaaattatt agtgttcttt caaatggaac aacaaatttc tttgacacac 

4971 cagcagaaaa agtatttggc aaaccagtag tatttacaga tgcagcagtt aaacctattg tgggagattt 

5041 caattatttt ggaattaact atgatggaac aacttatgac actgataaag atgttaaaaa aggcgaatat 

5111 ttgtttgtat taactgcatg gtatgatcag caacgtacat tagacagtgc attcagaatt gcaaaagcaa 

5181 aagaaaatac aggttcatta cccagctaag ccccaaaagg ttaatgtaac agctaaggct aaatcagctg 

5251 taatatcagc cgaatagggg tgatgaaatg agtttagaag aaattaaatt gtggttgaga attgactata 

5321 atttcgaaaa tgatttaatt gaaggtctca ttcaatcggc taagtctgaa ttactattaa gtggggttcc 

5391 agattatgac aaagatgact tggaataccc gcttttttgt acagcgatta gatatatcat tgcaagagat 

5461 tatgaaagtc gtgggtactc aaatgaccaa tctagaagca aggtttttaa tgaaaaggga ttgcaaaaaa 

5531 tgattctgaa attaaaaaag tggtaggtga tttttaaatg gaatttaatg aatttaaaga tcgcgcatat 

5601 ttttttcaat atgtaaataa agggccgtat ccagatgaag aggaaaaaat gaagttgtat agttgctttt 

5671 gtaaaatata taatccttct atgaaagata gagaaatttt aaaagcgact gaatcaaagt caggactaac 

5741 cataattatg aggtcttcta aaattgaata tctaccacaa acaaatcact tagttaaaat tgacagaggc 

5811 ttatattccg ataaattatt caacattaaa gaaataagaa ttgatacacc agatattggc tataatacag 

5881 tggttttatc agaaaaatga gtgtagaaat taaagggata cctgaagtgt tgaagaaatt agaatcggta 

5951 tacggtaaac aatcaatgca agctaagagt gatagagctt taaatgaagc atctgaattt tttataaagg 

6021 ctttaaagaa agaattcgag agttttaaag atacgggtgc tagcatagaa gaaatgacta aatctaagcc 

6091 ttatacaaaa gtaggaagtc aagaaagagc tgttttaatt gaatgggtag gccctatgaa tcgcaaaaac 

6161 attattcact tgaatgaaca tggttataca agagatggaa aaaaatatac accaagaggt tttggagtta 

6231 ttgcaaaaac attagctgct aatgaacgga agtatagaga aattataaaa aaggagttgg ccagataaat 

6301 gaatatatta aacaccataa aagaaatttt attatctgat gcagagctcc aaacatatat aaattctaga 

6371 atatactatt ataaagtcac tgaaaatgct gaaacttcca aaccttttgt tgttattaca cctatttatg 

6441 atttaccttc agacttcatg tctgataaat atcttagtga agaatactta attcaaatag atgtagaatc 

6511 ttcaaataat cagaaaacaa ttgatataac aaaacgaata agatatctgt tatatcaaca aaatttaatt 

6581 caagcatcta gtcagttaga tgcttatttt gaagaaacta aacgttatgt gatgtcgaga cgttatcaag 

6651 gcataccaaa aaatatatat tataaaaatc agcgcatcga ataggtgtgc tttttaattt ttaaggagga 

6721 aataagcaat ggcagaagga caaggttctt ataaagtagg ttttaaaaga ttatacgttg gagtttttaa 

6791 cccagaagca acaaaagtag ttaaacgcat gacatgggaa gatgaaaaag gtggtacagt tgatctaaat 

6861 atcacaggtt tagcaccaga tttagtagat atgtttgcat ctaacaaacg tgtttggatg aaaaaacaag 

6931 gtactaatga agttaagtct gacatgagta tttttaatat tccaagtgaa gatctaaata cagttattgg 

7001 tcgttctaaa gataaaaatg gtacatcttg ggtaggagag aatacaagag caccatacgt aacagttatt 

7071 ggagaatctg aagatggttt aacaggtcaa ccagtgtacg ttgcgctact taaaggtact tttagcttgg 

7141 attcaattga atttaaaaca cgaggagaaa aagcagaagc accagagcca acaaaattaa ctggtgactg 

7211 gatgaacaga aaagttgatg ttgatggtac tccacaaggt attgtatacg ggtatcatga aggtaaagaa 

7281 ggagaagcag aattcttcaa aaaagtattc gttggataca cggacagtga agatcattca gaggattctg 

7351 caagttcgtt acccagctaa cccccaaaat gttgaagtag cagttaattc aaaatctgca acagtttcag 

7421 cagaataggg gctttcaaaa taaatcaaag gagaataatt tatgactaaa actttaaagg tttataaagg 

7491 agacgacgtc gtagcttctg aacaaggtga aggcaaagtg tcagtaactt tatctaattt agaagcggat 

7561 acaacttatc caaaaggtac ttaccaagtg gcatgggaag aaaatggtaa agaatctagt aaagttgatg 

7631 tacctcaatt caaaaccaat ccaattctag tctcaggcgt atcatttaca cccgaaacta aatcaatcac 

7701 ggtaaatgct gatgacaatg ttgaaccaaa cattgcacca agtacagcaa cgaataaaac gttgaaatat 

7771 acaagtgaac atccagagtt tgttactgtt gatgagagaa caggagcaat tcacggtgta gctgagggaa 

7841 cttcagttat cactgctacg tctactgacg gaagtgacaa gtctggacaa attacagtaa cagtaacaaa 

7911 tggataatta tttgagacgc agaatatctg cgtctttttt atttgaataa aaggagctaa tacaatgatt 

7981 aaatttgaaa ttaaagaccg taaaacagga aaaacagaga gctatacaaa agaagatgtg acaatgggcg 

8051 aagcagaaaa atgctatgag tatttagaat tagtaaatca agagaataaa aaagaagtac ctaacgcaac 

8121 aaaaatgaga caaaaagagc gacagttatt agtagattta tttaaagatg aaggattgac tgaagaagat 

8191 gttttgaaca agatgagcac taaaacttat acaaaagcct tgaaagatat atttcgagaa atcaatggtg 

8261 aagatgaaga agattcagaa actgaaccag aagagatggg aaagacagaa gaacaatctc aataaaagat 

8331 attttatcga acattaagaa aatacaacgt ttctgtatgg agcagtatgg gtggacatta actgaagtca 

8401 gaaaacagcc gtatgtaaaa cttttagaaa tacttaatga agagaataaa gaagagactg aagaaaaaca 

84 71 aagtgaacaa aaagtcatta caggtacgga tttaagaaaa ctttttggaa gctagaaagg aggttaatat 

8541 gaatgaaaaa gtagaaggca tgaccttgga gctgaaatta gaccatttag gtgtccaaga aggcatgaag 

8611 ggtttaaagc gacaattagg tgttgttaat agtgaaatga aagctaatct gtcatcattt gataagtctg 

8681 aaaaatcaat ggaaaagtat caggcgagaa ttaaggggtt aaatgataag cttaaagttc aaaaaaagat 

8751 gtattctcaa gtagaagatg agcttaaaca agttaacgct aattatcaaa aagctaaatc tagtgtaaaa 

8821 gatgttgaga aagcatattt aaagctagta gaagctaata aaaaagaaaa attagctctt gataaatcta 

8891 aagaagcctt aaaatcttcg aatacagaac ttaaaaaagc tgaaaatcaa tataaacgta caaatcaacg 

8961 taaacaagat gcatatcaaa aacttaaaca gttgagagat gcagaacaaa agcttaagaa tagtaaccaa 

9031 gctactactg cacaactaaa aagagcaagt gacgcagtac agaagcagtc cgctaagcat aaagcacttg 

9101 ttgaacaata taaacaagaa ggcaatcaag ttcaaaaact aaaagtacaa aatgataatc tttcaaaatc 

9171 aaacgaaaaa atagaaaatt cttacgctaa aactaatact aaattaaagc aaacagaaaa agaatttaat 

9241 gatttaaata atactattaa gaatcatagc gctaatgtcg caaaagctga aacagctgtt aacaaagaaa 

9311 aagctgcttt aaataattta gagcgttcaa tagataaagc ttcatccgaa atgaagactt ttaacaaaga 

9381 acaaatgata gctcaaagtc atttcggcaa acttgctagt caagcggatg tcatgtcaaa gaaatttagt 

9451 tctattggag ataaaatgac ttccctagga cgtacgatga cgatgggcgt atctacaccg attactttag 

9521 ggttaggtgc agcattaaaa acaagtgcag acttcgaagg gcaaatgtct cgagttggag cgattgcaca 

9591 agcaagcagt aaagacttaa aaagcatgtc taatcaagcg gttgacttag gcgctaaaac aagtaaaagt 

9661 gctaacgaag ttgctaaagg tatggaagaa ttggcagctt taggctttaa tgccaaacaa acaatggagg 

9731 ctatgccggg tgttatcagt gcagcagaag caagcggtgc agaaatggct acaactgcaa ctgtaatggc 

9801 atcagcaatt aattctttcg gtttaaaagc atctgatgca aaccatgttg ctgatttact tgcgagatca 

9871 gctaatgata gtgctgcaga tattcaatac atgggagatg cattaaaata tgcaggtact ccagcaaaag 

9941 cattaggagt ttcaatagag gacacttctg cagcaattga agttttatct aactcagggt tagaggggtc 
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10011 tcaagcaggt actgcattaa gagcttcgtt tattaggcta gctaatccaa gtaaaagtac agctaaggaa 

10081 atgaaaaaat taggtattca tttgtctgat gctaaaggtc aatttgttgg catgggtgaa ttgattagac 

10151 agttccaaga caacatgaaa ggcatgacga gagaacaaaa actagcaaca gtggctacaa tagttggcac 

10221 tgaagcagca agtggatttt tagccttgat tgaagcgggt ccagataaaa ttaatagcta tagcaaatca 

10291 ttgaagaact ctaatggtga aagtaaaaaa gcagctgatt tgatgaaaga caacctcaaa ggtgctctgg 

10361 aacaattagg tggcgctttt gaatcgttag caattgaagt tggtaaagat ttaacgccta tgattagagc 

10431 aggtgcggaa ggattaacaa aattagttga tggatttaca catcttcctg gttggtttag aaaggcttcg 

10501 gtaggtttag cgatttttgg tgcatctatt ggccctgctg ttcttgctgg tggcttatta atacgtgcag 

10571 ttggaagcgc ggctaaaggc tatgcatcat taaatagacg cattgctgaa aatacaatac tgtctaatac 

10641 caattcaaaa gcaatgaaat ctttaggtct tcaaacctta tttcttggtt ctacaacagg aaaaacgtca 

10711 aaaggcttta aaggattagc cggagctatg ttgtttaatt taaaacctat aaatgttttg aaaaattctg 

10781 caaagctagc aattttaccg ttcaaacttt tgaaaaacgg tttaggatta gccgcaaaat ccttatttgc 

10851 agtaagtgga ggcgcaagat ttgctggtgt agccttaaag tttttaacag gacctatagg tgctacaata 

10921 actgctatta caattgcata taaagttttt aaaaccgcat atgatcgtgt ggaatggttc agaaacggta 

10991 ttaacggttt aggagaaact ataaagtttt ttggtggcaa aattattggc ggtgctgtta ggaagctagg 

11061 agagtttaaa aattatcttg gaagtatagg caaaagcttc aaagaaaagt tttcaaagga tatgaaagat 

11131 ggttataaat ctttgagtga cgatgacctt ctgaaagtag gagtcaacaa gtttaaagga tttatgcaaa 

11201 ccatgggcac agcttctaaa aaagcatctg atactgtaaa agtgttgggg aaaggtgttt caaaagaaac 

11271 agaaaaagct ttagaaaaat acgtacacta ttctgaagag aacaacagaa tcatggaaaa agtacgttta 

11341 aactcgggtc aaataacaga agacaaagca aaaaaacttt tgaaaattga agcggattta tctaataacc 

11411 ttatagctga aatagaaaaa agaaataaaa aggaactcga aaaaactcaa gaacttattg ataagtatag 

11481 tgcgttcgat gaacaagaaa agcaaaacat tttaactaga actaaagaaa aaaatgactt gcgaattaaa 

11551 aaagagcaag aactcaatca gaaaatcaaa gaattgaaag aaaaagcttt aagtgatggt cagatttcag 

11621 aaaatgaaag aaaagaaatt gaaaagcttg aaaatcaaag acgtgacatc actgttaaag aattgagtaa 

11691 gactgaaaaa gagcaagagc gtattttagt aagaatgcaa agaaacagaa atgcttattc aatagacgaa 

11761 gcgagcaaag caattaaaga agcagaaaaa gcaagaaaag caagaaaaaa agaagtggac aagcaatatg 

11831 aagatgatgt cattgctata aaaaataacg tcaacctttc taagtctgaa aaagataaat tattagctat 

11901 tgctgatcaa agacataagg atgaagtaag aaaggcaaaa tctaaaaaag atgctgtagt agacgttgtt 

11971 aaaaagcaaa ataaagatat tgataaagag atggatttat ccagtggtcg tgtatataaa aatactgaaa 

12041 agtggtggaa tggccttaaa agttggtggt ctaacttcag agaagaccaa aagaagaaaa gtgataagta 

12111 cgctaaagaa caagaagaaa cagctcgtag aaacagagaa aatataaaga aatggtttgg aaatgcttgg 

12181 gacggcgtaa aaactaaaac tggcgaagct tttagtaaaa tgggcagaaa tgctaatcat tttggcggcg 

12251 aaatgaaaaa aatgtggagt ggaatcaaag gaattccaag caaattaagt tcaggttgga gctcagccaa 

12321 aagttctgta ggatatcaca ctaaggctat agctaatagt actggtaaat ggtttggaaa agcttggcaa 

12391 tctgttaaat cgactacagg aagtatttac aatcaaacta agcaaaagta ttcagatgcc tcagataaag 

12461 cttgggcgca ttcaaaatct atttggaaag ggacatcaaa atggtttagc aatgcatata aaagtgcaaa 

12531 gggctggcta acggatatgg ctaataaatc gcgctcgaaa tgggataata tttctagtac agcatggtcg 

12601 aatgcaaaat ccgtttggaa aggaacatcg aaatggttta gtaactcata caaatcttta aaaggttgga 

12671 ctggagatat gtattcaaga gcccacgatc gttttgatgc aatttcaagt tcggcatggt ctaacgctaa 

12741 atcagtattt aatggtttta gaaaatggct atcaagaaca tatgaatgga ttagagatat tggtaaagac 

12811 atgggaagag ctgcggctga tttaggtaaa aatgttgcta ataaagctat tggcggttta aatagcatga 

12881 ttggcggtat taataaaata tctaaagcca ttactgataa aaatctcatc aagccaatac ctacattgtc 

12951 tactggtact ttagcaggaa agggtgtagc taccgataat tcgggagcat taacgcaacc gacatttgct 

13021 gtattaaatg atagaggttc tggaaacgcc ccaggtggtg gagttcaaga agtaattcac agggctgacg 

13091 gaacattcca tgcaccccaa ggacgagatg tggttgttcc actaggagtt ggagatagtg taataaatgc 

13161 caatgacact ctgaagttac agcggatggg tgttttgcca aaattccatg gtggtacgaa aaagaaagat 

13231 tggctagacc aacttaaagg taatataggt aaaaaagcag gagaatttgg agctacagct aaaaacacag 

13301 cgcataatat caaaaaaggt gcagaagaaa tggttgaagc agcaggcgat aaaatcaaag atggtgcatc 

13371 ttggttaggc gataaaatcg gcgatgtgtg ggattacgta caacatccag ggaaactagt aaataaagta 

13441 atgtcaggtt taaatattaa ttttggaggc ggactaacgc tacagtaaaa attgctaaag gcgcgtactc 

13511 attgctcaaa aagaaattaa tagacaaagt aaaatcgtgg tttgaagatt ttggtggtgg aggcgatgga 

13581 agctatctat ttgaatatcc aatctggcaa agatttggac gctacacagg tggacttaac tttaatgacg 

13651 gtcgtcacta tggtatagac tttggtatgc ctactggaac aaacgtttat gccgttaaag gtggtatagc 

13721 agataaggta tggactgatt acggtggcgg taattctata caaattaaga ccggtgctaa cgaatggaac 

13791 tggtatatgc atttatctaa gcaattagca agacaaggcc aacgtattaa agctggtcaa ctgataggga 

13861 aatcaggtgc tacaggtaat ttcgttagag gagcacactt acatttccaa ttgatgcaag ggtcacatcc 

13931 agggaatgat acagctaaag atccagaaaa atggttgaag tcacttaaag gtagtggcgt tcgaagtggt 

14001 tcaggtgtta ataaggctgc atctgcttgg gcaggcgata tacgtcgtgc agcaaaacga atgggtgtta 

14 071 atgttacttc gggtgatgta ggaaatatca ttagcttgat tcaacacgaa tcaggaggaa atgcaggtat 

14141 aactcaatct agttcgctta gagacatcaa cgttttacag ggcaatccag caaaaggatt gcttcaatat 

14211 atcccacaaa catttagaca ttatgctgtt agaggtcaca acaatatata tagtggttac gatcagttat 

14281 tagcgttctt taacaacaga tattggcgct cacagtttaa cccaagaggt ggttggtctc caagtggtcc 

14351 aagaagatat gcgaatggtg gtttgattac aaagcatcaa cttgctgaag tgggtgaagg agataaacag 

14421 gagatggtta tccctttaac tagacgtaaa cgagcaattc aattaactga acaggttatg cgcatcatcg 

14491 gtatggatgg caagccaaat aacatcactg taaataatga tacttctaca gttgaaaaat tgttgaaaca 

14561 aattgttatg ttaagtgata aaggaaataa attaacagat gcattgattc aaactgtttc ttctcaggat 

14631 aataacttag gttctaatga tgcaattaga ggtttagaaa aaatattgtc aaaacaaagt gggcatagag 

14701 caaatgcaaa taattatatg ggaggtttga ctaattaatg caatcttttg taaaaatcat agatggttac 

14771 aaggaagaag taataacaga ttttaatcag cttatatttt tagatgcaag ggctgaaagt ccaaacacca 

14 841 atgataacag tgtaactatt aacggagtag atggtatttt accgggcgca attagttttg cgcctttttc 

14911 attagtatta aggtttggct atgatggtat agatgttata gatttaaatt tatttgagca ttggtttaga 

14981 tctgtgttta atcgcagaca tccttattat gttattactt ctcaaatgcc tggtgttaaa tatgcagtga 

15051 atacagctaa tgttacatct aatttaaaag atggttcttc aactgaaatt gaagtaagtt taaatgttta 
15121 taaagggtat tctgaatcag ttaattggac cgatagcgag ttcttattcg actctaattg gatgtttgaa 
15191 aatggaattc ctcttgattt cacacctaaa tatactcata catcaaatca atttactatt tggaacggtt 
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15261 ctactgatac gataaatcca cgattcaagc 

15331 atttgaactg gttaactata caacaggtga 

15401 tttgttttag atggtgtgta tgcatatcga 

15471 taacattagc gccaggtaaa aatgaattta 

15541 gtttcctttt atttataggt aggtgattta 

15611 aattgatttg tgaaaattta ctagatgtag 

15681 taggtacatc acttttacag tttatagaac 

15751 aacttcataa tttatcatgg tgaaaaatac 

15821 tttttattga agttacggca tatcacataa 

15891 tgatgacgac agtagcgaaa ctggtaaaac 

15961 gcaaatcaaa aaacttcggt caaaatgacc 

16031 acgaattagg taacaaaaac ggcttagaat 

16101 cccaaatgat acggagatat gtttttattc 

16171 tatcaatata atactgatac tgtatctgca 

16241 ttggaaaaaa gtatacagct gaggaaaaga 

16311 aaatggtttt ataaaagaag gtacttatcg 

16381 tgcaagtatg gtaatgaaac agttagattt 

16451 ttttagacgg caagcaaatt aagcaaattt 

16521 aataaaaaat attgataaag gcaagcacgt 

16591 attgatatat cttcaaataa aaaagctaag 

16661 atttaattgc tgacaactca ggtcgcaatc 

16731 gcagtttggg attcgatatg ctaatacgca 

16801 tttgcaaaaa agcaaataaa tgatactcct 

16871 tagagccaag agatagcgta ttctttgttc 

16941 acttgatagg tcacatccat ttgtaaacgc 

17011 gtacaaattc aacaagcgct taacagacga 

17081 gtataaatca tttatacact agtactttga 

17151 ataatggcaa cagaagaagt taaaatcaaa 

17221 attggaaagc tataaatggg ataccttatg 

17291 catttcggta gatgataaaa ataaattaga 

17361 attgtacaga aatccccaaa cggtaaattg 

17431 tgctatttta ttagaaagga aggtgcatta 

17501 aggtcgagat tatagagcta aggaaataca 

17571 acagagttta agttgcatca gaaagctaaa 

17641 gatcgcaagc agcagtaaac aaatctttac 

17711 cagtgctaat gagattgttc aagctcgagt 

17781 ttagacgatt gggaaaccca aactcaaatt 

17851 aagaaattct tgatatcgag tatcgttttg 

17921 acctcttaca aatgcagtaa tgcaatcctt 

17991 gctagaaata atggctatat gctaagtcgt 

18061 taggtggggg tcatggtaca cataacggtt 

18131 cttaaatggt aataatgaga atacattagt 

18201 aagtatggta tgcaagatgt atttacagga 

18271 aagaaaataa aattctatac agaattgaga 

18341 tatagagata agaagtttag acgatgttga 

18411 atgagactaa caaacgaaac ccaaccaatg 

18481 caggagacag taatccaaat aatagaaact 

18551 tcaggttaat gctgactatg gtggaacact 

18621 caaatatact atgacaaaga tagtggtaaa 

18691 atagaacaca tcgtattttc atgattgggc 

18761 ttttatcatg agtgacacag gtggtagagt 

18831 gggatgttaa cagagccagg tctttactat 

18901 taccaagaga atggcgtgat gcaggttggt 

18971 tcagatattg acgcgtaata gttatgcaag 

19041 ggagacattt cggactggaa ttatgtgcct 

19111 caaaaatgtc agatattaac atagtaggca 

19181 agattttcca actgaacgta aaggggtagc 

19251 tttgttcata ggctagttcg taatagtgtt 

19321 aaacaagttc agggccatgg actttacacg 

19391 gtagctataa atttagaaaa cacagcgcat 

19461 gagagagtga ttcttctgtt cttcttttta 

19531 aaatatcaaa gcacgaatag cgattcgagg 

19601 ccatttaaag gtattttaaa atttcaatta 

19671 aagtttcggt tgcagaatta ggtaattcag 

19741 aaaaagtttg tttagcatga ttccatctga 

19811 aaaactatta tggatcgtgc gaaagcaatg 

19881 ttgaaaaagc taaagaaaaa ggtctatcag 

19951 gcaacttgct aatagccata tatctgattt 

20021 caaaagcgat atatggatga gaaacatgaa 

20091 gtggttctac ttcaaattgg caaaaagcta 

20161 atttgatttt aataatccag aacaaagaat 

20231 aattatccaa gaggtgttag tactaacggt 

20301 tgacttatcg accgaacggt acaaataaag 

20371 gtcagaatta gctattaatg attacaatac 

20441 atggccgaaa gtaacgctaa attatacgca 



acgatttgaa aatattaatt aatttaaatg cgagtggagg 
tatttttaag tacaacaaaa gtatagataa aaacactgat 
gatataaata gagtgggaat tgatacaaat agaggcatta 
agattaaagg agacatcagt gatattaaaa ctacatttaa 
atggattatc atgatcattt atcagtaatg gattttaatg 
attatggttc ttttaaagaa tattatgaac tgaatgaagc 
tactcataat agttttgttt tcgatttact aatttgtgaa 
acaattaagc agacagcgcc aaaggttgaa ggtgataaag 
tgtatgaatt tcaaaatcac tcagtggaat caaataagct 
gccagaatac tctttagatg agtacttaag atatggattt 
tataaaataa ttggaaattt taagcgaaaa gtaccgattg 
actgtaaaga agcggtagac ctatttggct gtataattta 
tcctgaaaca ttttatcaaa gaagcgagaa agtgattcga 
actgtcagta cattggaatt aagaacagct ataaaagttt 
aaaattataa tcctattaga acaactgaca ttaaatattc 
taccgcaaca attgggtcta aagctactat taactttgat 
acaataaaaa agggctctca aggtggaata tataagttga 
cttgttttgc taagtcggtt cagtctgaaa caatagattt 
tttagaaatg atatttttag gagaagaccc caaaaataga 
ccttgtatgt atgttggaac tgaaaaatca acagtcttaa 
aatacaaagc aattgttgac tacgtcgcag atagtgcaaa 
aacaaatgaa gatatcgaaa cacaggataa gctgttagaa 
aagactgaat tagatgttaa ttatataggt tatgaaaaaa 
atgaattaat gggatataac actgaattaa aggttgttaa 
aatagatgaa gtgtctttca gcaatgaaat aaaggatatg 
gttattgcac aagataatag atataactat caagcaaatc 
attctccttt cgagacaatg gatataggga gtgtattaat 
gcgctacttg aaaacgataa acagtacttt ccagctacac 
caggcagtag tgatattgat ggattgcctc aagacggtat 
taatttaaaa ataggcgaag caggaattat tcaaaatagc 
tggaaaataa cagttgacga tagtgggaaa cttggtacag 
tggaaaattt gtatttaata aaggatttgg gagctttagc 
aaacttacaa agaatagagc aatttgcgct tggcttgaca 
acaattcaac acttcgctga gcaaatttat tataatggta 
aaagtcaaat taacgcactt gttgtggcac cacgtaataa 
taatgtaaac ggcgaaacct ttgacacatt aaaagaacat 
aataaagagg aaactataag agaattaaat aagaccaaac 
aacctgataa gcaagaattt ttatttgtga cagaacttgc 
ctggtttgat aatagaacag gcatagtata catgacacaa 
ctaagaccta atggtcaatt tatagacagc tcattgattg 
atagatatat tgatgatgag ttatggattt atagttttat 
tcgtttcaag tatacgccta atgtggaaat tagctatggc 
cacccagaaa aaccctacat cacccctgtc ataaatgaaa 
gacctagaag tcactgggaa cttgaaaact caatgaatta 
taaaaatatt gataaagttt tgcataaaat cagtatccct 
cagggtgtga cttttgatga aaaatacttg tattggtata 
atttaacggc tttcgattta gaaacaggag aagaagcgta 
agattcattt cctggcgaat ttgcggaagc agaaggtttg 
aaagctttga tgctaggtgt tactgtcggt ggtgatggaa 
aaagaggtat tttagaaata cttcactcaa gaggcgttcc 
taaaccttta ccaatgaggc ctgataaact taagaatctt 
ttatacactg atcatacagt tcaaatcgat gatttcccat 
tcttggaagt taagccacca caaactggcg gtgatgtaat 
gaatatgatg acttttgaaa gggtgctttc tggaagaact 
aaaaatagtg gtaaatggga gagagtacct tcattcatca 
tgtcgtttta tttaactacg gatgatacaa aacgttttac 
tggttggaac ttatatgtag aagcttcaaa cacaggtggc 
acagcatctg ctgagatact attgaaaaat tatgatagta 
aagggagaat tataagttaa tgagtaattt agagaaatct 
tatgaaaata tttcaaatct agatataact tttagaacag 
at ate act aa aaataatcaa ccgttattat tgagtgaaga 
taaaggagtc atggtagttg ctccactaga aatattagat 
cctaatgatg taattaaacg agatggaagt tatcaagctc 
acgtggtagt tgtcgagaga actatcacat ttaacgttga 
aacaaaatta cactatattg ttgaatttca ggaattagaa 
gacgaggcta taaaaaatgg tgaagattat gcgagtctga 
atattcaaat agcaaaatct tcaagtatag atgaattaaa 
ggaaaataaa gegcaagcat attcaagaac attcgatgag 
gccttcaagc agtcagtgaa tagtggtggt ttagtcacaa 
agattactaa agatgatggt aagataatgc agattactgg 
aggtgattca acccaattta tttatgtttc gcaagctata 
actgtcgaat atttagtagt aacttcagat tacaagegta 
tgtttgttaa aagaaaagaa gcgggttcat ggtctgagtg 
accttttgaa actgttcaaa gtgcccaatc aaaagctaat 
gatgacaagt ttaataaaag gtattcggtt atttttgatg 
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20511 gaacagcaaa tggtgtgggc tctacattgt 

20581 ttatgggact tttccaggtg gtgactttac 

20651 aatccctcaa atcttccaga tggtgatgga 

20721 gtcgtacatc tttaactata tcaaacgatg 

20791 cgcaaataga gggacaatta acaaaattat 

20861 atgagataat ttcatacgct atcattggtg 

20931 tttctctcaa gtttttagac ctaaagcctt 

21001 tcagaagaaa aagatgactt gcatcaacag 

21071 tcttacgaaa aatggttgct agtatgcaga 

21141 taagcaaaat gcactaatgg caaaacaact 

21211 actgaaaatg cttaaattaa tttcaccaac 

21281 agtaaagaag atatagcgtg gtatgtagat 

21351 gagaaaagta tccagaaaat ctagagtcat 

21421 tggtgtaatg tttggattta ccaaacgaca 

21491 aagactatgt ttgaaaaatt cgacagaata 

21561 tagatagaaa tttcgaagaa ctaaggcgtg 

21631 aaatattaga gacatcaaga tgtggattct 

21701 ttgttaaaaa ctatttttgg catttaaagg 

21771 ttggtcgtgt ttctggttta gtaagtgtaa 

21841 tttggaaaaa aggagcaaac aaatggatgc 

21911 gtaaatcaat tcttagcgaa caaaggtatt 

21981 tacttactgt tgttgcttta tatactacgt 

22051 tcaaaagcta aagaaatata aagctgaaaa 

22121 gtaatgacac ctacgaatat gaacgacaca 

22191 aaccaagcag aaaaatggtt tgataattca 

22261 agtgttacga ttacgcaaat atgtttttta 

22331 taatattcca tttgataata aagcaaggat 

22401 ttaccgcaaa agttggacat tgtcgttttc 

22471 ttgagagcgc taatctaaac actttcacat 

22541 cgttgcgcaa cctggttggg gtcccgaaac 

22611 tttattagat taaatttccc agataaagta 

22681 ctgccaaaaa gcaagcagta attaaaccta 

22751 tggagcagta ggaaacggaa caaacgaacg 

22821 tatttaagac atgccggtca tgaagtcgca 

22891 atacagcata cggtgttaat gtaggtaata 

22961 tgacattgtt ctagaaatac atttagacgc 

23031 agtcaattca atgcagatac tattgataaa 

23101 gaggtgtaac acctcgtaac gatttactaa 

23171 atctgaatta ggttttatca ctaataaaaa 

23241 aaattaatag ccggtgcgat tcatggtaag 

23311 ttaaaaacga aaagaatccg ccagtgccag 

23381 agaaactggt tattacacag ttgccaatgt 

23451 agaattactg gtgtattacc taataacgca 

23521 gatggattac ttatattgct aatagtggac 

23591 taatagaata agcagttttg gtaagtttag 

23661 cttcggtact tgcctattat ttaaaattaa 

23731 aaacaaacgt ttttagtata taaattattt 

23801 tcaactatat cgtggtttta tgtttattat 

23871 acgggttttt ttcgaaataa tagtaaaaaa 

23941 aaatatttaa ttttattaaa agttaaaaag 

24011 atgattttta tggtcaaaaa aagactatta 

24081 cttcgtttca tgaatctaaa gctgataaca 

24151 aacagaagat acaagtagcg ataagtgggg 

24221 aagtataaca aagacgcttt gattttaaaa 

24291 acaaaaacac agatcatata aaagcaatga 

24361 ccccaatgta gatttaataa attatctacc 

24431 ggttataaca taggtggtaa ttttaatagt 

24501 aaacaattag ttataataaa ataaaaagta 

24571 caggagtttt aggttacata ccatataaat 

24641 tatcaatact cctgtattat tgattttttc 

24711 tttaataatg ttgatttaaa aaatttgaat 

24781 ttctatttat ttttgtttta acagtgtttg 

24851 taatataacc agaaagttta tgaaattgga 

24921 aacaacggta aaccagtatt tatagttata 

24991 aaacctataa ttcagctggt agcgatttcg 

25061 tttaccgtca aacgatgaat tgtatattaa 

25131 ttatatttaa tgaatgaata ctaatctttt 

25201 tggcgcccgg cttttcaaaa cttttgttta 

25271 cgccataaaa ttctcaccac cattcaacgt 

25341 gaatcttctt tggttaactt atctccatct 

25411 gtgacgataa atctttaggt aactcataag 

25481 agtttctttt tttattttgc aattagttat 

25551 gtcttttata ttaaagcgcc acacaggcgc 

25621 ctaaacgaag cgactttgat atcatcatac 

25691 tatctacacg cttgataaga cttactccat 



acttaaatga gagtttagac caatttattt tattaatttt 
agagtttggc agtccttttg gaggaggaaa gatttcattg 
aatggtggag gtgtttatga gtttggatta actaaatcta 
tctatttcga cttaggaagt caaagaggct ctggtgcgaa 
aggagtgaga aaataatgca aatattagtt aacaagcgta 
gctttgaaga aggtattgat attgaaaatt taccagaaaa 
taaatattca aatggggaaa tagtttttaa cgaagattat 
attgacagtg aagaacaaaa cacagtcgct tctgatgaca 
aacaagttgt tcaaagtaca aagttatcga tgcaagttaa 
tgtgacactt aataaaaaat tagaagaggt taaaggagag 
attcgaagat attaaaacat ggtatcaatt gaaagaatat 
atggaagtta tagataaaga ggaatatgca attattacag 
aggttataat cttatggctt tttaatttga ataaagtggg 
cgaacaagat tggcgtttaa cgcgattaga agaaaatgat 
gaagacagtc tgagaacgca agaaaaaatt tatgacaagt 
acaaagaaga agatgaaaaa aataaagaga aaaatgctaa 
aggattaata gggacgattc taagtacatt tgttatagcc 
aggtgattac catgcttaag ggaattttag gatatagctt 
gtaatagtta agagtcagtg cttcggcact ggctttttat 
aaaagtaata acaagataca tcgtattgat cttagcatta 
agcccgattc cagtagacga tgagaatata tcatcaataa 
ataaagacaa tccaacatct caagaaggta aatgggcaaa 
caagtataga aaagcaacag ggcaagcgcc aattaaagaa 
aatgatttag ggtaggtgtt gaccaatgtt gataacaaaa 
ttagggaagc agttcaatcc tgatttgttt tatggatttc 
tgatagcaac aggcgaaagg ttacaaggtt tatacgctta 
tgaaaaatac gggcaaataa ttaaaaacta tgatagcttt 
ccgtcaaagt atggtggcgg agctggacat gttgaaattg 
cgtttggcca aaattggaat ggtaaaggtt ggacaaatgg 
cgttacaaga catgttcatt attacgatga cccaatgtat 
agtgttggag ataaagctaa aagcgttatt aagcaagcaa 
aaaaaattat gcttgtagcc ggtcatggtt ataacgatcc 
cgattttata cgtaaatata taacgccaaa tatcgctaag 
ttatatggtg gctcaagtca atcacaagac atgtatcaag 
aaaaagatta tggcttatat tgggttaaat cacaggggta 
agcaggagaa agcgcaagtg gtgggcatgt tattatctca 
agtatacaag atgttattaa aaataactta ggacaaataa 
atgttaacgt atcagcagaa ataaatataa attatcgctt 
tgatatggat tggattaaga aaaactatga cttgtattct 
cctatcggtg gtgtgatatc tagtgaggtt aaaacaccag 
caggttatac acccgataaa aataatgtac cgtataaaaa 
taaaggtaat aacgtaaggg acggctattc aactaattca 
acaatcaaat atgacggcgc atattgtatc aatggctata 
aacgtcgtta tattgctaca ggagaggtag acaaggcagg 
tgcagtttga taattgtata tgatgaatct taggcaggta 
taaacagtta atttttacat gaatatatta aattttaaaa 
tgtgttcgta ttgtgtgcta tgattaaaaa gttgttatgg 
caatcaaaat ataaattatt tataatttgt ttggtaatga 
acacatttgt agatatttta aactcggtaa atcttttaat 
gtttaatata aaaatgtaat aaaatttata aagaaaggaa 
gctgcaacat tgtcgttagg aataatcact cctattgcta 
atattgagaa tattggtgat ggcgctgagg tagtcaaaag 
ggtcacacaa aatattcagt ttgattttgt taaagataaa 
atgcaaggtt ttatcaattc aaagactact tattacaatt 
ggtggccttt ccaatacaat attggtctca aaacaaatga 
taaaaataaa atagattcag taaatgttag tcaaacatta 
ggtccatcaa caggaggtaa tggttcattt aattattcaa 
ggtgataaga tgactcaatt tctaggggcg cttcttctta 
atctaacaat gataggttta gttagtgaaa aaaacaaggt 
tattgaaaca tgtttgatat ggttttatag ttttataatt 
ttaattcagt tgcttacagg tctaaaagca aatattttgt 
tatttaatcc tttaattgtt aaatttatta tctggttaat 
ttgtataagc ttattagaca aaagagacaa gttgtttaat 
aaagactttg aaaacagaat cattgaagag ggtgaactta 
atttactaga agttgagcga caagatttca aagtatctga 
acatacactt gtagacctta aacaacaaat taaattggat 
ttcttagctt tttctgataa agtgcttttt aatttttcgc 
ttgggttact acgagtagct tcttgttttt tgtttttatc 
ctacacttgt aggcgttttt ttatttagta aagtcataat 
attttttgtg aaataaattc caagtattta cgcgcattat 
tgaatggttg attaccacta gttaaaactt catatactat 
tttcattata aacttccttt caaacactgc tgaaatagac 
tgttaatcac aatacaactt tgcccattac tttaatatta 
ttcggattta gagataccaa attaatatag tcttcgcata 
ctaatacaac gagtgcaatt gtaccatctt taatagaatc 
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25761 ttctttctta ataaaagcgt atgttccttg 

25831 caaaaatcag catttgatgg cgtttcgtct 

25901 attcttctcc tatgccagca ccagttgcac 

25971 atctatagaa gtgactttat tctgttcttc 

26041 ggaggtgtga gtttgttgta tatggaagtg 

26111 acaaatcatt aatcttcaca ttgaagtact 

26181 ttcattttcc cattttgata tcttgccttt 

26251 gttgctaatt gttccatagt catattttta 

26321 cgaaaccctc cttatataag ataatttcat 

26391 gcaaaagttg ttgacatcga aacttttatg 

26461 agggggttca atgacaacta gtgtagcaga 

26531 ggaactaacc aaaaagaagt tgctaaagca 

26601 gaattaatgg cagagatttt acaacttcag 

26671 tgattttttt taaactttaa gtttcgaaag 

26741 taacgttaac caaagaagag ttgaaagaaa 

26811 accaatcagc tcaggtgcaa ttttcagtaa 

26881 aaactcaatt tcgcaaaaga tttgtcgcta 

26951 agtatcagca tggcttcgaa tcaattcatc 

27021 attaacatta tcaatttttg gagtgacact 

27091 aaaatttata gagatatcaa aaactattat 

27161 atgatttcga atgaaggagg aactacaaat 

27231 ttaattgacg tgtggcatgg aaatcaatgg 

27301 tctcggatag agaaggtaag aaatatctaa 

27371 ctattgcctt acaatcctaa atcttttctg 

27441 aaatgctgaa atagtcacga gcaacgctat 

27511 ttcgttatga atcttatgtc tatctagagc 

27581 tctaaatcca taaatttcac ctccttccac 

27651 ggaacgacaa atgcaagctc aaaacaaaaa 

27721 ccattagata ttcaaattaa tgacggatat 

27791 aagaaatacc atacgtaaat aataacttat 

27861 ttttgagaaa gatattgaaa agctaatttc 

27931 ctcttttaac ttcgttccaa gttttattgt 

28001 gtcatcaata atccaagaaa cgaccctgcc 

28071 tctaatttta aaagtgagta cattactgtt 

28141 tatgtcgagt aagtggttca cctattttct 

28211 gtgattaagt ttcatcctat cacctccata 

28281 aggaataaca aatgaacatt caagaagcaa 

28351 agattggaaa gaaagtcatc gaactaagat 

28421 aatagcgatg ggacaaacct tatcagatat 

28491 aagttataaa cccaactaga gaccaggaat 

28561 aaattgtttt taaactcatt ttcaaagtaa 

28631 tactagcata cacgccgttt aggaacccag 

28701 atgtagtttt tgaaaatact ttgtatgtat 

28771 gaacctaact ttacacattc taaataatct 

28841 gtgtatcaaa ttcatcagat atcaagggca 

28911 aaggagcata aacaaatgaa cacaagatca 

28981 ctgatgcttc ttcatcctat ttaacggaaa 

29051 aaaaagtgat tacagctact tagaaataaa 

29121 gcgaataata acaaacttta acatttatct 

29191 caccagaaaa cacatatcga ggcgaagaaa 

29261 tcaattgttt ggagtatgta gaagtacagt 

29331 gtagaaaatt tatacattga ttattcagca 

29401 tgatcagaaa gcataaaaaa tggtattagg 

29471 agcagtgttg tgcttcacgg tcttagcgat 

29541 attgcgggat tcgcaagtat cgcaacattc 

29611 gctacttgtt ggagcaagta acagtgcaag 

29681 tatgacctta caacaaaaaa tactatcaca 

29751 gaagtttttg ggatatctaa aacacatgca 

29821 aattggaaag ttggggtatc tggcgtgttg 

29891 agagatatta gaagaacaat tcgagttatt 

29961 gaagaacgca tcaagttaat gattcgttta 

30031 agaaggtatt tttgaagaat taaaactatt 

30101 gtagattcat caattgtaca agagaaagtt 

30171 aatcagttga agaagttaag gaaacttctg 

30241 gttccttaaa aaagcagata cttctgataa 

30311 aagctatcta ctatcaaaga agagcattat 

303 81 gaagctagat cactcaaata gagctcatgc 

3 0451 ccaccgagta ttaaggcaag tgaaggtatt 

3 0521 ctcatgagtt aagtgagtta tatttcagtc 

30591 ttttcaaaat tataagcgaa atcaatatta 

30661 aatgtagaag aaaaatataa cgaagctttg 

30731 tggatttagg taaatacgtc cctgaatctt 

30801 tgaaattatt gaccttaaat acggtaaagg 

30871 tatggcttgg gcgcatatga actgcttagt 

30941 aaccacgaat agataacttt tctactgaag 



ttttaacata ggttccattg aatcaccatt aactaaaata 
tctttaaaaa atacttcttc atgcaatatg tcatcatata 
cacatgcaat atacgatact agtttagact ctttatatcc 
caattgttca tttgcatagt taagtacgtt ttcttggcgg 
atgtcgttat cgtctttgta tgtagtattt gattcactat 
cagccaaaat tttggcagtt gataatcgag gttcttcctt 
cgttaatttc attaagtcgg gatatttatt attaagatca 
tttttttctt agcttcttta aaccttcacc aatacccata 
tataaaagtt tcgaaaacga aacgcaagga aaatattatt 
atgtattctt aaatcaagtt gttacaaacg aaacaaaagg 
taaaccatac ttaaaaataa aaagcttgat tgcacttaaa 
atcggaatga gtagaagttt attgagtata aagataaatc 
aagctaaaaa attagcagat catttaaatg ttaaagttga 
tgacaactaa ataaaaataa ggaggacact atggaacaaa 
ttatagcgaa agaagttaga aatgctataa aaggcgagaa 
agtaagaatc aataatgacg atttagaaga aatcaataaa 
ggaagattga ggaagctcaa tcatccgatt ccgctaaaaa 
aaaaagctta tgtacaagat gttcatgacc atattagaaa 
taattcagac ttgagtgaaa gtgaatacaa cctagcagca 
ttatatatct atgaaaagag agtttcagaa ttaactatcg 
gaaactacta agaaggctat tcaataaaaa acacgaaaac 
ttaaaagtga aagaaagcaa attaaaaaaa tataaagtgg 
ttaaataagc gcacttaatt agtgcaagta atcaagtgcg 
cttttttctt cttcttgtaa tcccaataac acagaagagt 
ctttagcgaa tgcaattacg tcatcaccga cttcttgcca 
tctaggtaat agcgagattg taatatcgtg agcaattttc 
tgggagataa ctaaattata taacaaaaca acttaaagga 
agtcatctat tactactatg acgaagaagg taataggcga 
gaactgatgg tccgatctca tttcatcaac aacaccattg 
atgccttggt tgatggttat gaatttaagt tagattgaat 
cccataagat taagagacat actggatgtt ttgttaacga 
ctctaatatt atcgagaaat tcatggccag accaagtgat 
ttcgatgaat ttcagatcgc aacaaataaa tttagcttct 
tcaaaatcat atttatcaaa aataatatta tcgttgaaat 
tattagattc tatttctaag agcaagagtc taacgcaatc 
acaggagtat agcagaaagg atcataaaca tcttaaaagg 
ctaagatagc tacaaaaaat cttgtctcta tgacacggaa 
attaccaaca aatgatagtt ttttacaatg catcatttca 
tggcaacctt cagccgatga cctcatggca aatgattggg 
tattgaagca attttagaaa tgctatcaat gatacttttt 
acaacagtct tgtctgaaat tgttacatga taaatagtgt 
agtttttaag tttatttaaa tcgtatttta catcttcgaa 
atctttagca cttccaaaat tattgcaggt taatttaacc 
ttgtagagta cggacaagat atattgttgg tctttagtaa 
tgttatcacc tccttaggtt gataacaaca ttatacacga 
gaaggattgc gtataggcgt cccacaagtt tctagcaaag 
aggaacgtaa cttaggagcg gaaatattag agcttattaa 
caaagttttc tatgcattag atagagaact tcaatacagg 
aaaggagtga tagagatgcc aaaaatcata ataccaccaa 
aatttgtgaa aaagttatac gcaacaccta cacaaatcca 
atacaactgg ttgaaatatt accgtgaaga taatttaggt 
acgggaacat tgattaatat ttctaaatta gaagagtatt 
aggattatca aatgagcgac acatataaaa gctacctatt 
tgtactcatg ccgtttctat acttcactac agcatggtca 
atattttata aggaatactt ttatgaagaa taaagaaact 
atgagcaatt gtcttaaata attatataag gagttattaa 
ttttgcaaca tatgacaatt tcaattctga tgatgttgtt 
aaatccacac tttcaagact taagaaaaaa ggaaagattg 
ttgaaccgca gttacattta actgttgtag aacgtaagaa 
ggcaagatta aacgaacaaa gtgatgaccc tagagaaata 
gccaaccaat tttaaggagg agttaatcaa tggcaatatt 
aaataagaat ttacgtgtgc taaatactga actatcaact 
aaagaagcac caatgccaaa agatgaaaca gctcaactgg 
ctgatttaac taaagattat gttttatcag taggaaaaga 
gaaagaattt agaaataaac ttaacgaact tggtgcggat 
gaaaaaattg ttgattttat gaatgcgaga ataaatgcat 
aaagcttagt gcaagtggag caaaacaatg gctaaactgt 
gcagataaaa gttcagtttt tgctgaagaa ggtacattcg 
ttaaatatga aggcctaaca cagtttgagt ttaataaagc 
cagtgaagag ttgcgcgaat atgttgaaga gtacgtagct 
agtagagatg acgatgtaat agctttattt gaaacaaaat 
ttggtactgg tgatgtcatt atattttcag gtggtgtact 
cattgaagtt tcagctatag ataatcctca acttagatta 
ttaatgtatg acattcatac agttcgcatg actatcatac 
agttaccaat atcaagatta cttcaatggg gaaccgattt 
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31011 tgttaaacca ttagccagac ttgcttataa 

31081 tgtaagataa agcattcatg tagaacacgt 

31151 tgttgagtga tgaagagatt gcagaacttt 

31221 agaaaaatat gcact agate aagegaaaga 

31291 cgctcgcgaa gaatgataac tgatacaaat 

31361 aagatattac agaaaccaag ttacttagca 

31431 taaaattgea gaaggcttta tagaaaagee 

31501 ccagctataa ageaatctge tgaagatgat 

31571 atgaaagcaa aagtattaaa taaaactaaa 

31641 ttgaacctca cagtatgeaa gaagggcaag 

31711 tacaagtacg ataaaageca ttgaacaagc 

31781 ggaggcaaag ttcctgcaaa tctgaaactt 

31851 attatcaaga cgcttatttt attaaegcat 

31921 tagattaacg gattctggaa ctattgtaag 

31991 aacacaaatg gtaataaggg tategcagtt 

32061 ttggcggtgc aagtgcagca gaagatgatt 

32131 caataggtgg ggtttttagc cccactttaa 

32201 atatagatat tgaaacatat agcagtaacg 

32271 agatttcgaa atcttaatta tagcttattc 

32341 gtagataatg agcctttcca cgctgattat 

32411 agtatgeatt caatgetaat ttcgaaagaa 

32481 agaatggatt tgcacaatgg ttaattcaat 

32551 gttttaagac tacaaaacca aaaagataaa 

32621 agecaacaaa agttaatgga ggaagaacaa 

32691 tatagattac tgtattcgag atgtagaagt 

32761 actgtaattg aacaagcata ttgggttttt 

32831 cattgatgtt aggagctaat gtgetcgata 

32901 aacaggttta gaaaatccta atagtcctac 

32 971 atacctaatt tacaaaagaa aacggttcag 

33041 tagaaattag attgeaaatg tctaaaacca 

33111 tgatgaaegg gtaagaggtc tgtttcaatt 

33181 caacttcaga atttaacaaa gcattatatt 

33251 aacaacgttt tgacgattta gatttattac 

33321 gacgacattt actgetgaag aaggtaatga 

33391 atagcatggt atgeaaaaga acaatggcgt 

33461 cggcttctca aatgtttaat gtaceggtag 

33531 agtgtccgaa ttagctttag gctatcaagg 

33601 ggcattgaag aaaacgagtt acaaggttta 

33671 tttggaaggc ttgecaagag getgeaatta 

33741 tagattttat atgaaaaaag gttttctaat 

33811 gctttagttg gtgaaaatag ttggggtagt 

33881 ggtcaaagtt aaaaaegtat ggtgggaagt 

33951 tgegatttet atagcaaggc ttgaagcatt 

34021 gtagaaatac ctagaggttc aaatggactt 

34091 caaaaggatt gaatttgaat agtgacgggt 

34161 catgeaacat caagcttata teaatgette 

34231 tacaatcaga ttgataaaga aaaagaaaat 

34301 aatataacgt tataaatatt aaggttttag 

34371 gtgegtatea aaggaaaact aatgacattg 

34441 ttagatatag atacaaacat aaaatgcgcg 

34511 agatgaagtt gaatatatgc agagtcaaat 

34581 gcgattttga acctatacca acgaaatgtg 

34651 catggcttta tgatggaacg ccacaaaaac 

34721 aatgttcaag aaatggagtg aagcataatg 

34791 aagacaatgt taagcaacca gcgcactaca 

34861 tacggcacag tatccacctc aactagcatt 

34931 ttaaagaatg gtcatgagga tttagcaaag 

35001 gatgaccatg acagatagcg catgtaaaga 

35071 caggataacg aacgagtggc acatatccat 

35141 caggctggca aggcgtgaaa aagacatttg 

35211 tttggaatac gaggaacaga agcaactaac 

35281 aaaaataatg aaaatagacg aattaattaa 

35351 tattatacaa cagacaaaaa tgatgaaaac 

35421 gcgattttat attagttaat gctactttta 

35491 tgataggttg tttgaagtgt acgagattca 

35561 ataaacgaac gtttaaaaaa tgacagaatt 

35631 etatgaegtt aatttggaaa gaaggagagt 

35701 aagcaaagtg aatactggtt tgatgaagca 

35771 tagctaagtt aagagagtgc aacaaagagc 

35841 cgttgaaaaa gatttaataa acgaatttgg 

35911 aataaaattt ttatggagga agaegcaaat 

35981 agegegtata aeggcaatga cacagaggga 

36051 cgtttgatga aatacttgag ggtttaccta 

36121 tgaagcagta ggaattatga egggtcaagt 

36191 attacaagtg aaactattat caaaagaege 



cggtgaaggt gagtttaaag caggtagtca ttgtagattc 
gcagaataca tgcaaaatgt gectcaaaag ccaccacatt 
tatataaact gcctgacatc aaaaaatggg ctgatgaagt 
aaatgataaa aactattctg gttggaagct tgtagaaggt 
geaaegcttg aaaagttagt tgaagcaggt tataaacctg 
ttacgaattt agaaaaatta ateggcaaaa aagcattttc 
acaaggtaaa ttaacacttg ctaccgagtc tgataaacga 
tttgacaaac tataaaaatt aaaaaggacg gtatataaac 
gtgattacag gaaaagtaag age at cat at gcacatattt 
aagcaaagta ttcaatcagt ttaatcattc ctaaatcaga 
tatagaagct gctaaagaag aaggaaaagt tagtaagttt 
ecattaegtg atggagatac tgaaagagaa gatgatgtga 
caagcaaaca agcacctggt attattgacc aaaacaaaat 
tggtgactat attagagctt caatcaattt atttccattc 
ggattgaaca acattcaact tgtagaaaaa ggcgaacctc 
tcgatgaatt agacactgat gatgaggatt tcttataagt 
ttttaaagaa attgaggtgt caagaatttg aaatttatga 
atatttcgaa atgtggtgtc tataaataca cagaagctga 
aatagatggt ggaccgatta gtgcgattga catgactaaa 
gagaegttta aaattgetet atttgaccct gctgtaaaaa 
cttgtcttgc taaacatttt aataaacaga tgccacctga 
gcgtattggc ttacctgett cgcttgataa agttggagaa 
gcaggtaaaa atttaattcg ttatttctct ataccttgta 
gaaatttgee tgaacatgat cttgaaaaat ggcaacaatt 
agaaatgaca attgetaata aaattaaaga ctttccagta 
gaccaacata taaacgacag aggtattaag ctttctaaat 
agcagagtaa agaagaattg cttaaacaag ctaaacatat 
acagttattg gcttggttaa aggatgaaca aggattagat 
gattacttaa aagtagcaac aggaaaagct aaaaaaatgc 
gtgtgaaaaa atacaacaaa atgeatgaca tgatgtgcag 
ctacggtgcc ggtactggaa gatgggcagg tagaggtgta 
tcagatactg aattagaaat agcaagagat cttattaaag 
tcaatgttca tcctcaagac ttattaagtc aattagttag 
actagcagta agtgattttt ctgeaataga ggcaagagtc 
ttagatgtgt tcaacacaca eggaaagata tatgaagcat 
aaagcataac taaaggegae cctctcagac aaaaaggaaa 
tggcgctgga gctttaaaag caatgggtgc attggaaatg 
gttgatagtt ggegtaaege aaatcctaac atagttaatt 
atactgtaaa atcccgaaag aegcatcata cacatggact 
gattgaactg cctagtggaa gagctttagc ttatccaaaa 
caagttgttg aatttatggg gttagatctt aacegtaaat 
tagtcgagaa tattgttcaa geaactgeaa gggatttact 
aggttttaaa atagttggcc atgtccatga tgaagtaatt 
aaggaaatcg aaactatcat gaataagect gttgattggg 
ttacttctcc gttttatatg aaggattagg agtgtgattg 
tgttgacatt agaattccta cagaagtcga aagtgttaat 
ttggcggact atttatttaa taatccaggt gaactattaa 
atttagaggt ggaatgatgg ctagaagaaa agttataaga 
agagaagttt cagaaaaata tcacatatct ccagaacttc 
gcgatgaatt attgtgtgga agaaaagact caaaatctaa 
aaaagatgaa gaaaaagaga gagaaaaaat cagaaaaaaa 
agageggaat atgaagaaga aagaaagaga agattgagac 
attcacgtga teegtactgg ttcgatgtca cttataacca 
agegtaatea gtaacagaaa agtagatatg aacgaagege 
cataeggega cattgaaatt atagatttta tcgaacaggt 
cgcaataggt aatgcaataa aatacttgtc tagagcacct 
gcgaagtttt aegtccaaag agcttttgac ttgtgggagt 
atacttaaac caatttttcg gatctaagag atatctgtat 
gtagtgaatg gcacttatta etttcaeggg catategtae 
atacagegga agagctcgaa acatatataa agcaacatgg 
tttattttaa ggagatagaa atgatgaaaa tcaaagttga 
gtgggcgcga gaaaatcegg agctatcatt tggcagaaaa 
tttatttact tcggtgtttt taaaaattgt tttaaaataa 
gtgtcaaagt tgaagaagaa gtaaccgaag aaactaagtt 
agaaggagtc tataaatctg catcatatga gaatgctagt 
tttcttgeta aagcattcta catcttaaac gacgacctaa 
tgattaaata atggaacacg gttcaaaaga atattacgaa 
agcaagtttt tgaagcaacg tgatgagctt attggagata 
tggagaagaa ageaagtgea tgggataggt attgeaagag 
caaagatggt gaaagagtta aatttggaat ggaattaaac 
gaataacege gaacaaatcg aacaatcagt tattagtget 
ttattaaaag agattgagga cgtgtataag aaagcgcaag 
atgetatgea agatgeaate aaagaagata ttggtcttga 
tgtctataaa tatgaggagg agcaggaaaa tgactaacat 
tagaatgeca gaacgaaatc ataagaegga tgcaggttat 
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36261 gacatatttt cagctaaaac tgtcgtactt 

36331 taagcattcc agagggctat gtcggtttat 

36401 gattgaaaca ggcaagatag acgcgggata 

36471 acgttagaga gtgaggatat gagtaacttt 

36541 tacctgtaac agataaattt ttatgtatga 

36611 attggttatc gtgcctatat ggacacctga 

36681 ggagcaaaag gcttcggaag tagcggagtg 

36751 tgagtgacat gttagaaata tttttcatag 

36821 tctcaagagt aaaaagacta tacacacaaa 

36891 tctacatttg ctgataaaca tcaaaagacg 

36961 agctcaaaca agttcaaggg agctatgagg 

37031 ggacgtaagc atacacacat aactaaagct 

37101 aagaagaagc gaaagagaag tacgaggcac 

37171 tgaaaatata agggagtgtg ggaaatgact 

37241 agctaggcaa gtatgtaact gagcaagtat 

37311 tgactttgaa aaaatcagag ctgaagtttc 

37381 tgtgtttagc aatagcactg gaggtgttgt 

37451 ttattgatct tgttaagtag catttctcat 

37521 atcttttcaa gcagttagta aaaaataatg 

37591 aattagaaaa agatttaaaa gctaaagaga 

37661 ccgaacatgt tagacaaagt cactcaaata 

37731 ctgctagtcg tttatctaca cattggacta 

37801 acaaacagtt agaactaaag aagatttaac 

37871 aaagatgttg gcggatttgt cggtggttat 

37941 gttcaatgtt aacacttgat atcgattatg 

38011 ttttgcatat tgtttatatt caacacataa 

38081 ttaaaacgaa atgtaaatgc agatgagtat 

38151 attacttcga tgatacaact tatcaaccac 

38221 atttttcttt acctatgaag atttaccttt 

38291 actgacacat tagaatggcc aacgtcttca 

38361 gcgacccaga agaaaagccg ggaattgttg 

38431 aacttttatt cctgatttat acgaaaaaca 

38501 ggtggattgg tgttatacga aaataacaag 

38571 tgcttgtgaa cagttttgat ttagtacgca 

38641 tactccggtt aatcgactac ctagttataa 

38711 aagcaattaa ttaacgacaa aatgtctgat 

38781 catggtctga gacgttagaa attacttcga 

38851 attgcgtaat gatccaaatt taaaaggaaa 

38921 gggaaaatgc catggaataa taattttaaa 

38991 gttatatcga aaagatttat gacatacacc 

39061 gcaaaatgcc tatcatccag taagagatta 

39131 aagttattta tcaaatactt aggtgttgaa 

39201 ctgctggaat cgctcgagta atggagccag 

39271 aggtgtaggt aaatctgctt tgctaaaaaa 

39341 actggtaagg aagcatatga ggcattacaa 

39411 gaaaagctga agttgaagct attaagcatt 

39481 acattatatt gaagattttc caaggcaatg 

39551 gatgaaactg gtggaagacg tttttggcca 

39621 aactaaccaa agaag agate gaccaaatct 

39691 gttccttaac cctgaactag aagaagaaat 

39761 acaggtatta ttgatgaata tcttaacacg 

39831 gaagacgatt ttatcaaggt gatgttgata 

39901 ggtctgtgcg cttgaagtgt ttgttgaatg 

39971 attagaaaga tttctaacgt cttaagacaa 

40041 aaattcgatt tggaaaagat tatggtgtac 

40111 ataagaaata ttgaataaat atacattttt 

40181 aacacggtgg tgtaaaaagt aatcgtaggt 

40251 gatacaacac ctctttccct tctcgctgta 

40321 ctataaagtt taaaaagtag tgttagggag 

40391 aagtttggca agaattgatg caacategga 

40461 gaatcgacat tagaaaaata tttagtgaaa 

40531 cacctggaac aagaggtgta ccagatagaa 

40601 gaagcaagaa aagggaaagt tacatccttt 

40671 acagtgtatg tgttatggaa taaagaacaa 

40741 attgatttca aaccacatag ctatcaaaag 

40811 tgtttttaga tatggggcta gggaaaacag 

40881 cactaaaaaa atgttagtca tagcacctaa 

40951 tggaaccatt taaatcatct gaaagtgtct 

41021 acacagaggc tgatatctat gtaaccaata 

41091 atggccattt gacatggttg taattgatga 

41161 tctattaaaa agaaattacc actcattaat 

41231 tacaggattt atgggctcaa gtttatttga 

41301 tcgagaaagg tactttaaac caacacatca 

41371 tctgaagaaa agatatatga acgaatagaa 

41441 tgcctgacag agttgatact aaacaaacag 



gagecacaag aaaaggcagt gatcaaaaca gatgtagctg 
taactagccg tagtggtgta agtagtaaaa cgcatttagt 
tcatggtaat ttagggatta atatcaagaa tgataatgaa 
ggtcggagtc cttctggtat agatggaaaa tacaccctac 
atggtagtta tgtcataaat aaaggegaca aactagctca 
actaaagcaa gtggaggaat tcgagagtgt ttcagaacgt 
taaagacata ttagatcgag tcaaggaggt tttggggaag 
ggtttggtgt ttatctattt tgtegcatag gtattatttt 
cctatatgaa atgttgttga ttgetactat ctttgtgaca 
catatcttaa tagcattttt agtaatgttt tttatgagta 
aatgacacaa tacctagtca caacatttaa agattcaaca 
aagagcaatc aaaggtttac agttgttgat gcggagagta 
aagttaaaag aaatgcagtt attaaattag ggcagttgtt 
aaacaaatac taagattatt attcttacta gcgatgtatg 
at at tat gat gaeggctaat gatgatgeag aggegecgag 
atggtaatag ctattatcat ttttgaatta attatattaa 
aaatatgtgg attgtcattt caattgtttt atctatattt 
aagatgaaaa ccatagaagc attggagtat atgaatgett 
gtgttgaagg tatagaagat tatgaaaatg aagttgaacg 
ggcgttggct tctctgttct atttaaaata atgaaaggag 
gaaacaatta aatatgatcg tgatgtttca tattcttatg 
atcacaatat ggcttggtct gaetttatge agaagctagc 
tgagtacaat aaaatgtcta agtctgaaca agecgatata 
ttaaaagaag gcaaacgacg tgctggtcaa gtcatgaatc 
ctgetcaaga tatgactgac atattatcta tgttttatga 
gcatagagag ataagtccaa gactgcgttt agtgattcct 
gaagctattg ggcgtaaagt egcagatate gttggcatgg 
ataggttaat gtattggcct tcaactagta aegatgegga 
gttagaccca gataaaatat taaatgaata tgttgattgg 
agggaagaga gtaagactaa aagattagca gataagcaag 
gtgcattttg tagagectat acgat agaag aagctataga 
ttctactaac cgttatacct atcatgaagg ttcaactgea 
tttgectatt ctcatcataa tacggatccc gtaagcggta 
tacacttata tggtgctcaa gatgaagacg ctaaaacaga 
ageaatgeag caaagagege aaaatgatga agttgttaaa 
geaatgeagg atttcgatga aatagtaaat agegatgatg 
aaggtacttt caaagctagt atcccaaata tagaaattat 
aatagcattt aatgaattta caaaacaaat tgaatgetta 
ataegtcaat ggcaagaegg tgatgatagc agtttaagaa 
attcaggcaa aacaaaagat gecattataa gcgtagcaat 
tctaaataaa atatcgtggg atggacataa aegtcttgaa 
gacactgaag tgaatagaac aactaccaaa aaggcattga 
gatgtaaatt tgactatatg cttacacttt atggtcctca 
aataggtggt gcatggtttt ctgacagttt agtttctgtt 
ggcgtttggt taatggaaat ggcagaactt gcagctacaa 
tcatatctaa acaagttgac cggtttcgtg ttgcttatgg 
tattttcatt ggtacaacta ataaagttga tttcttaaga 
atgactgtaa atccagagag agttgaagtg aactggtcta 
gggcagaagc taaatactat tatgaacaag gagaagagtt 
gcgttcaatc caaagtaaac atactgagga atctccatat 
ccaatcccaa gcaattggga agacttaact atctttgaaa 
tgttaccaac aggaaatgta gattacattg aaagagacaa 
ttttggtaaa gataagggag atagtagagg atctatggaa 
ttagacaatt ggtctgtata tgaaggcaat aaaagtggga 
agatagegta tgtaagagat gaaagtttag aggatttaat 
agatgttgta tcaaatgttg catcattttt tgagtgatgc 
gttgtatcat ttttggtgat gcaacattga tgcaacaaat 
aggttcaacc ctgtttgttt ccaatgttgc atcaaattca 
taaaggggta taggggtaac cctctaacag ctatttttaa 
acacaaatat aaattttgta tacaaggtga ataaatgaaa 
gagataacaa agttaaatgg attatgttta aaatgggtcg 
ttattattat gecagaagga aaaacatatt ttgtagaaat 
acaaaaatat gtgeategge aatttgaaaa cagagatcat 
gtaaatactt ttataagaat ggtaggtgga acatttggcg 
tatgeaatag ataaagtgat tgataatgag aaatacggtt 
tatcaacact tacagcattt agtgaattgc agttgttaga 
acaagttget aaagatacat gggttgatga agttgataag 
ttagtcttag gaacacctaa agaaagaaat gatgeattaa 
aagaaaatac taaatggtta tgtgatcaat ataaaaaaga 
actgtctaca tttaaaagtc ctaagagtca aaggtttaaa 
agatttatag gattaacagg aacacctagt ccaaatagtt 
tagacagagg cgaaagactt gagtcttcat teagtegtta 
agttagcgaa catgttttta actgggagct aagagacgga 
gatatatgtt taagcatgaa agegaaagat tatctggata 
tagtcttatc tgaaaaagaa agaaaagtat atgaagaatt 
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41511 agaaaaaaac tatattttag aatcggaaga 

41581 caaaaactac ttcaactatc taacggtgca 

41651 agaagttaga taagttagag gaaattatag 

41721 caaacatgat aaagaaagaa tacttcaaag 

41791 gaacgttgga atagtggaga cattaagctg 

41861 tacaacaagg tgggcacatt attgtttggt 

41931 tgcaagatta tatagacaag gacaaaatca 

42001 gatcaaagag tatataaagc tttacaaaat 

42071 caagaatagc taagcataag taatggaggt 

42141 atttaaatat attgaatcag aaatatataa 

42211 gagatactta acccaacgaa agaactagac 

42281 ttagaacaac tgagttaatg gcgacaaggt 

42351 tgaagcagtt gaaagtgagt acttaaagtt 

42421 aataaagata agaagctaaa gatagaacaa 

42491 caatacgaaa gaactttgtt aaagcgatag 

42561 gcaaaaggcc tacaaatctg tagtaatatg 

42631 cgacataaat acatgaggca catcgctaag 

42701 tgaccaagca taataacatt tataagcatg 

42771 agcatggaag aagttaagag agatagcatt 

42841 gatattataa cagatgcaaa gattgtgcat 

42911 acttagataa tctaatgtca gtttgttata 

42981 taatcttaag aaaattagag ttctaaaaat 

43 051 tgcccatcgg cttaaaatgt tttttcgccg 



agaaggaaca gttgtagctc agaatggggc atcattaagt 
gtttatacag atgatgaaga tgtaagactt atacatgata 
aggagtctca aggccaacca atattattgt tttataactt 
gtttaaggaa gcaaccacat tagaggattc aaactataaa 
cttatagcac atccagcaag tgcagggcat ggattaaact 
ttggacttac atggtcattg gaattatacc aacaagcaaa 
tacgactatt attcatcaca tcatgaccga taacacaata 
aaagaactaa cgcaagaaga attgatgaaa gctattaaag 
ataagatggg aaaggcgtca tatgatatta agccaggaac 
tttaaatgag aacaagaaag agataaatag attgagaatg 
accaacattg tgtatggacc gttacaaaaa ggagagccag 
tattgactaa taagatgtta cgtaacttag aagagatggt 
acctgaagat cataagaaag taataaggtt aaagtattgg 
ataggggatg cttgtcacat gcatcgcaat acagttacta 
cgtatcatgc aggtatcaaa taacattgtg caaagattgt 
atagtatcgg aaagatgtat aaagttatct gaaagttata 
cggtgtgtct tttgttatgc aatcaaagag gtgtaagaga 
gtcgtaagtc atatcaatac gattggttct atcattcaaa 
agatagagat aattatcttt gtcaaatgtg tttacgcgaa 
cacattattt atgttgatga agattttaac aaagctttag 
gctgtcataa caaaattcat gcaaatgata atgacaaaag 
ttaaataaaa aaattattta aataaaattt tatgcccccc 
ggtaccggag aggcc 
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Table 8 



Bacteriophage 3A ORFs list 



SID 


LAN 


FRA 


POS 


a. a. 


RBS sequence 


STA 


STO 


100379 


3AORF001 


1 


8515 . .13488 


1657 


acaggtacggatttaagaaaacttt 


ttg 


taa 


100380 


3AORF002 


2 


37667 . .40114 


815 


1 1 1 aaaat aatgaaaggagccgaac 


atg 


taa 


100381 


3AORF003 


1 


32188 . .34149 


653 


ttaaagaaattgaggtgtcaagaat 


ttg 


tag 


100382 


3AORF004 


3 


17457. .19370 


63 7 


gctattttattagaaaggaaggtgc 


att 


taa 


100383 


1AORF005 


1 


334 . . 2034 


566 


aoaaaaaaaat sat t caaaaaaaaa 


ctta 

y »-y 


taa 


100384 


3 AORF0 OS 


1 


15571 . .17154 


527 


cttttatttataggtaggtgattta 


atcr 


taa 


100385 


3AORF007 


2 


19337 . . 20836 


499 


atgatagtaaaacaagttcagggcc 


atg 


taa 


1 00386 

J. V V J v \J 


1AORF008 


3 


22176 . . 23630 


484 


aatgattt agggtaggtgttgacca 


atcr 


tcra 


100387 


1AORF009 


1 


40726 . . 42093 


455 


gtaaatacttttataagaatggtag 


QtQ 


taa 


1 00188 


1AORF010 


3 


13491 . . 14738 


415 


gaggcggactaacgctacagtaaaa 


att 


taa 


100389 




2 


2039 3277 


412 




ata 

y *-y 


taa 


100390 


1AGPF012 


2 


4001 5209 


402 


aaacLaagagacLaaaatt saacgcga 


atg 


taa 


1003 91 


1AOPV01 1 


1 


10179 "JiqAt; 


388 


at t* t" +* a 1~ cia 3 1" Cir 1 criers ^ t - aaat"C3f 


atg 


taa 


1 flA7 99 


T&fiPPOl 4 


2 


1 471R 15569 


274 


a t* 1~ a t - 3 t~nofT3 <~i<~i1" i~ irrs ft" ?5 3 1~ f" 3 




tag 


i nAiQi 

1 vu J 


lanpfroi 5 


•J 


4014 


261 


\* l. L<MCLCIL. CCLCL^jCLdCl.CtV'Vr L. L. L-^j CX.Q. Cl^J 


— ~ 

gtg 


tag 


100394 




_ 2 


95587 9697"* 


228 


day cLd^ji* L- cicxy ctctctCL dctd i_ etc^ciaci w 


atg 


tga 


1 001 95 


lanppni i 




6799 7170 


213 


f~t*aa't~i~t~'t~'t" aaftnannaaat'aaar'a 


atg 


taa 






■J 


94 540 951 54 


204 


dctucicvcictL dcicictciy u ciy y uy aucicty 


atg 


taa 




-Sjf-iLATt.T J. _? 


2 


•31 Cfit: lOI OQ 
J13D3 . . J^l^o 


187 


ahaaaaat"l"aaaa sir* it a a 
C LaLdaaadL L, ctclctciciy y ctoyy uci U 


ata 


taa 


i ooi 98 








187 


nf'anh Jicr^rs a t" 1" 3 1 - napnerrfh paarr 
y wciy l. ciyyciciL. i« ouy oijyyy 1— ^ cicty 


ttg 


taa 


100399 


*s aopfo91 


2 




174 


y l^CLCtCCLCtCLCL L. UCtCLCLy CLCLCiyy CICL 


atg 


tga 


J. U v *a u u 


1 AOPF099 


_2 


1242^ 12918 


171 




att 


tga 


100401 


lAOP'F'n91 


u_ 


74(^9 7Q1 7 


151 


otdctctL, del a. u L-dctctyy dy aauaaL u u 


atg 


taa 


1 00409 


■5 anpp"n94 

onUivr u Zi ti 


1 


9^7*^1 971 74 


147 




atg 




i /">A40"3 


i anpwfi9 5 




4910^ 49547 


145 


t~ a a ftri af aa it+* aaf nrf^ /-rirt" a +• a a /*r 
U ady LdL day L- dd L.y y dy y L-dL-ddy 


atg 


taa 


i 00404 


-3 BOPF096 


2 


£i iJ Zi • t J ^> <J J 


138 


a ?i n r* a a r 1 1 - 3 a r* t* 1" t* a t* 1~ t - 1~ 3 a crci 3 o 


ata 


taa 


n nft405 




2 


DODO. < 0^70 


136 


a +* a t* +- /-Ti r, r/*' f* a+*aa , t~ ana ni- f~ 1" t~ 

dt citcyycu auaaLaLdy uyy U L- L. L, 


ate 


taa 


i 004 06 


■5 AOP'P'09 R 


-3 


97R45 98955 


136 


rr* t* t - f" t" a a cr a 1" cr i - 1* t - a i~ Cf a 1" p r 1 1* 1* 




taa 


100407 


1 anpT?n9 q 


■5 


74T44 1474ft 


134 


t*t* a aorrt" 1" t" 1~ acta t" 1~ t" acracrcifcrcra 
L. u ctdy y lil l nyaL u dy ciy y Lyya 


atg 


taa 


1 004 08 

J. U V 1 u o 


•C* x^v-^xv u j u 


2 


£299 fi£94 

S3 A ^ ^ * • OOJ* 


131 


t" a'haaaaaaacfaflt* i~cicf c nacrat" a a 
LwLuduuciciyy cxy l. ^yyv^ayataet 


atg 


L.dy 


100409 


1AOPF011 


1 


90833 21225 


130 


1" t - a a p a a a a t* t" 3 1 - acrcra ot* aacras a 


ata 


taa 


100410 


1AORF019 


-2 


-J J7 .> U *x • i *I W J O 4. 


125 


a a a 1" a cr p t* cr t - 1* a era crcrcrt - 1* a p c o c t" 


ata 


tag 


100411 


1AORF011 




7957 8195 


122 


era a i - a 1" pf* cr peri* p t* t" t* 1" f - 1* a t* t - 1* era 


ata 


taa 


1 00419 


1AOPF014 


-2 


9850fi 28871 


121 


cr f - f" a t* pa a p pi - a acrcracr ert* aataa p 


atg 


taa 

L.dy 


100411 


■a AC/RF015 


_2 




121 


i~ ppl'acrpt - 1" ppt" aaraararcaroa 


ata 


tga 


100414 




2 


10020 10189 


120 


a p p a a 1~ t* i~ i~ a a ci cr a cr cr a cr t* 1" a a t* p a. 

ciuv dd l# v. i». cLdy y dy y dy *~ clci ^ d 


atg 


tga 


100415 


1AORF017 


2 


21818 . , 22165 


115 


aagtgtaagt aat agttaagagt ca 


y ty 


tag 


100416 


•3 AOPF01 R 


-2 


49001 42147 


114 


cri - a pt* p a pt" t"l~ pa a pi" ar t"t" c a ar p 


ate 


tga 


100417 


1AC1PF01 9 


2 


91 IRfi 91 797 


113 


hrranaaaaf" p^acracri - pa t" a crert* t - 


ata 


taa 


100418 


ianpFn40 


-3 


99654 99995 


113 


r*foal"faar , hrr i t"r i r , t"t*sasat"hacf 
LtyaL uctcto LLCUOtL Laaaat l. y y 




taa 


100419 


■s aopfo41 


-1 


4111 4671 


112 


, l"af i 1'aaa1"r | 1"ar'a1"^t"fTai"r , rat"f , ra 
(.ctL taaa L ^ LaL aiu LyatvLaLy d 


att 


tga 




"3 a du a 9 

OHUKr U f± .5 






iin 

-LIU 


a a a 2S a a r^r \~ r*^rr~t a *^r/*rt~ rra +^ ^- "t - 4- +~ a a 
tddddady Lyy lay y LgdL t LtLaa 


atg 


Cyd 


100421 


l AOPF04.1 




9 5690 9601 9 


109 


t - appaaat"1 - aa'r*a1~ arrt* fh t" prrpai - 
LaLvaaaL 1— dd l, di^dy L-t* L. L. v^y L. 


ata 


tag 


1 00499 




*5 


99676 10005 


109 


rrt - pt"t- aaat"aat"t"fli"a1* a a rTCrarrt* f - 
y LLU LactaLaclL UdL.dL.da.yydy t, L- 


att 




100423 


1AORF04 5 


3 


30 . . 353 


107 


cget ageaacgeggataaat 1 1 1 1 c 


atg 


taa 


100424 


3AORF046 


3 


27894. .28214 


106 


aagatattgaaaagctaatttcccc 


ata 


tga 


100425 


3AORF047 


-2 


11907. ,12227 


106 


ttcgccgccaaaatgattagcattt 


ctg 


tga 


100426 


3AORF048 


-3 


40343. .40663 


106 


ccataacacatacactgtatgatct 


ctg 


taa 


100427 


3AORF049 


-3 


6749. .7069 


106 


tgttaaaccatcttcagattctcca 


ata 


taa 


100428 


3AORF050 


1 


42700. ,43014 


104 


ttatgcaatcaaagaggtgtaagag 


atg 


taa 


100429 


3AORF051 


-2 


13077. .13388 


103 


ttgtacgtaatcccacacatcgccg 


att 


tga 


100430 


3AORF052 


-3 


3722. .4024 


100 


gcatttcatttcctcctaataactc 


att 


tga 


100431 


3AORF053 


3 


17145. ,17444 


99 


tcgagacaatggatatagggagtgt 


att 


tag 


100432 


3AORF054 


-1 


19915. .20211 


98 


ataatttatagcttgegaaacataa 


ata 


tga 


100433 


3AORF055 


-1 


42436. .42729 


97 


aatcgtattgatatgacttacgacc 


atg 


tag 


100434 


3AORF056 


3 


40455. .40745 


96 


taaattttgtatacaaggtgaataa 


atg 


tga 


100435 


3AORF057 


-1 


38665. .38952 


95 


atcatcaccgtcttgccattgacgt 


att 


taa 


100436 


3AORF058 


-1 


21265. .21549 


94 


gaaatttctatctaacttgtcataa 


att 


tga 


100437 


3AORF059 


-2 


10278. .10562 


94 


tttagccgcgcttccaactgcacgt 


att 


tag 


100438 


3AORF060 


1 


5278. .5556 


92 


atatcagccgaataggggtgatgaa 


atg 


tag 


100439 


3AORF061 


1 


35668. ,35946 


92 


tttggaaagaaggagagttgattaa 


ata 


taa 
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xu u 




9 

£t 


J3JXZ , i JglO ( 


y x 




ata 


taa 


XUU44X 


TfinPCAfil 
JriL/Kir UO J 


■3 
J 


/ ZU • • JO?73 


Q1 




ctL L 


4-rra 

tya 


1 A H/l A O 

XUU442 


3AUTCC Uo4 


_ A 


J0094 , . jsy oy 


y x 


CCy 1 1 at aCy cy C t aycaC t aat aa 


ctg 


t- a a 
Ldct 


XUU44 3 


JAUKr uob 


_ A 


^zo9 / . . jzy / z 


y x 


aaCCgLttLCttttgLaQftLtayyt 


aid 


t-aa 

lad 


1 f\(\AAA 

XUU444 


0 a f\o i?r\ £ c 
3AUKr uod 




zyxb / * . zy4zy 


Q A 

y u 


CoaaCtCtaaCatLCaLCLaaayga 


gtg 


4- a >- 

tag 


XUU44b 


jAUKU U 0 / 


-2 


OCCCI A £ Q *3 A 

zooox . . zby 


89 


at ac 1 1 1 1 1 tagcgg* a.at C99*^t9*3' 


ttg 


taa 


i n A ^ A c 
lUU44o 


3AUKr Uoo 


-2 


QO/l 000*2 

y&z4 . . yo9j 


oy 


+- -f- 4— ^ -f- ATi^ +- A +- A A A -^1 +~ A" "t - "I 4— 4— AT ^ 

t ttt Saty CaLCT. CCCaugt at tya 


at a 


tga 


XUU44 / 


JfVJK.r uos 




XJ04 / . .X4XXU 


O / 


tcjca.fcfc'fccctccfccja.tfc cgtgttg^s* 


ate 


tga 


1 A AA A S 


oriVJrtE U / U 


1 
i. 


3AQQT "3 COCfl 
JJJ . . JOZ3U 


03 


+-+-+- a a a-+- AAaaaAraA*Af-'t - +-+ - A"iA't~t* 


9t9 


t-aa 
Ldd 




J>rtf K.T U / X 


2 


3i7/c 35fin , 3 


oc 


a a a t" Act - 1* a 3 a a* 3 a a +" Atrta a+- ai a anp 

dcLdLy l LLciciycicicLLyy cty Lyctctyu 


at a 


f-rra 
tya 


100450 


*2AOPF079 


- 1 


?7"?79 ?7filfi 

Z/O/J. >« / OJO 


85 


L L Ly L L.y L LUU LLLL L LClCty L Ly L L 


ttg 


taa 


100451 


*3,anPF'fl7'3. 


2 


J / JO J • • J /OiJ 


82 


Lyy LcLa.Ld.yL Let L Ld L Ld. L L L L Ly CL 


att 


taa 


100452 




_ 2 




82 


L-y LLLyLLLLLL L ddddL L L da LClL 


att 


taa 


100453 


jriWivr U / 0 


_ "2 
-J 


9471 971 Q 


82 


dy LdL Ly L L LydddLLLL L L dauuL 


ttg 


tga 


100454 


■irt.L'rCE U / 0 


2^ 


9fiflA7 9S9Q9 


81 


aarrt* a Anf a*- +- nnrTirrnnannt - 




tag 


i a a a gc 

X UU4bb 




0 

£, 


7Q7Q7 1QC27 
Zozyz . . Z ODJ / 


ox 


^ |- a f" ^a-^a AiAf ^ rt'ra a +* a a a 
aaCat Cl L daaayy ayy aa l aaCaa 


a t- Ar 

atg 


t- arr 

tag 


1 A Ayl C£ 

XUU456 




-X 




/ y 


tttty*tstaag"y"cttagatttdCftc 


at t 


t-aa 

taa 


100457 


3 AURr 0/9 


-2 


b4b0 . . boyy 


f y 


dfctcagtccfcfcfc tfcaaasit ttctct 


ate 


taa 


XUU4bO 


JAUKr Uo U 


A 

-A 




/ 0 


cccycaatcactctayucLtaLLca 


at a 


taa 


i a a>i c q 
XUU4by 


3 AURr uol 


- 6 


(3*3 CO ft/i Q Q 

oZbz . . 04 00 


/ O 


aacffc t ttct taaatcccft accfccjfca 


atg 


t- Ata 

tga 


X U U4 o u 


1 71 f"M2 C A Q O 


- X 




77 


at at C L aC ay aCadCC CyaCCCytC 


at- a 

at a 


t-aa 
tda 


XUU40X 


3AUK.r uoo 




"^AfllQ ^A979 


77 


a a A-f- 1- a a p/^t-rrrrat" l~a+*'l - aaa , (~aa 
dLdy LLCaCCLyyaLLdLLadcttaa 


at a 


tga 


J. U vfatOZ 


jJr\\JK.C Uo4 


-1 


XZUU/. tXZZfcU 


77 


aAa'H+"f*+"+*t - f"i^a+-+*4- Ayr a a A; A A 3 a a 
dUdL L L L L L L CdL L L Ly LLy LLdcicl 


dty 


taa 


XUU4&J 


jAUKC U O D 


_ A 


jZjO / . . J Z D J? f 


76 


a "f- 4- 0 a ^ a at Art** a aAf arra a a ^ a a a aj a 
CttaCaayyCaCayayaoaLaaCya 


dtt 


t- a a 
tad 


i aaac a 




_ A 


"2ft<;i ft "JOftAft 


76 


afafaaf nf a -a Art* +- A»a ArAra +- *- a +- A+- 
atttLaaLCtaagttyayyaCtatCC 


at- a 


1- a a 
tda 


i aaacc 


O. 7a rtP P A Q *7 
jAUXJJ U O / 


_ "2 
J 


n/iQ73 
Z4 /4o , . Z4y / J 


"7 c 


atag9"^tttaay*ttcaccctcttca 


atg 


t- Ata 

tya 


i a a^i 


1 TA AO C A Q D 

3 AURr Uoo 


-3 


Xz9oU , .XJzU4 


/4 


tctttcfcfctttccftaccaccatcf^s. 


at- 4- 

att 


t- a a: 

tag 


100467 


3 AURr 0 0 9 


A 


<1QA /1Cr\Q 

4z90 . . 4bUo 


oo 
/ 2 


acacfS^-S^-^S*- & t at caat c 1 1 1 aa 


atg 


taa 


i n ft a ft 

100468 


3AORF090 


3 


A A A A ^ AAlJl*5 

28926 . .29141 


71 


ttatacacgaaaggagcataaacaa 


atg 


taa 


100469 


3AORF091 


-2 


13587 . . 13802 


71 


cttgtcttgctaattgcttagataa 


atg 


tag 


f n A a 1 A 

100470 


-% 7. /-\ri rift nil 


2 


26471 . .26 683 


70 


aaacgaaacaaaaggagggggttca 


atg 


taa 


100471 


3AORF093 


-1 


2524 . .2736 


70 


tccaccgttttcttcatagtactgt 


ttg 


tga 


XGQ472 


*> 71 f\Tt ft ft A 


~3 


25334 . .25546 


70 


tggcgctttaatataaaagacgtct 


att 


tga 


100473 


3AORF095 


3 


8316 . . obzb 


69 


aagagatgggaaagacagaagaaca 


ate 


tag 


i An ^ ijf 

100474 


*i 71 /AT5 W A A 


2 


A^"AAA 1 TI AO 

36992 . - 37198 


68 


aacaagttcaagggagctatgagga 


atg 


tga 


100475 


iAORrO 9 7 


-1 


T5CQ3 *3 O H Q O 

32593 . , 32 /yy 


£ Q 
OO 


aaagcttaatacctctgtcgtttat 


atg 


taa 


1 A A>l nc. 
XUU4 /b 


JAUKl U 5 5 


-1 


1 C3*C 1CCEO 

Xb34b . .xobbz 


b 0 


aatccatt aaatcacct acctat aa 


at a 


tag 


1 A <\A *7*7 

XUU4 / / 


J AUKr U 9 9 


X 


TOT C "7^ 1 Q 

/22b . . /42o 


C "7 


actggtgactggatgaacagaaaag 


*- +• AT 

ttg 


tag 


XUU4 / o 




-2 


OOCTrt OOQO"3 

z2o2U . .zzozj 


£ "7 

6 / 


cgacttcatgaccggcatgtcttaa 
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41 


acctaattcagataagcgataattt 


ata 


tga 


100613 


3AORF235 


1 


25558 . .25680 


40 


aacactgctgaaatagacgtctttt 


ata 


tag 


100614 


3AORF236 


1 


34420 , . 34542 


40 


acattgagagaagtttcagaaaaat 


ate 


taa 


100615 


3AORF237 


3 


38442 . .38564 


40 


gaagaagctatagaaacttttattc 


ctg 


taa 


100616 


3AORF238 


-1 


33628. .33750 


40 


caatcattagaaaaccttttttcat 


ata 


taa 


100617 


3AORF239 


-1 


29248 . .29370 


40 


t c 1 1 c t aat 1 1 agaaa t at t aat ca 


atg 


tag 


100618 


3AORF24 0 


-2 


18156 , .18278 


40 


gt ct ct caa 1 1 c t gt at agaat 1 1 1 


att 


taa 


100619 


3AORF241 


-2 


8088 . .8210 


40 


tttcaaggcttttgtataagtttta 


gtg 


tga 


100620 


3AORF242 


-3 


39149. ,39271 


40 


ttagcaaagcagatttacctacacc 


ttg 


taa 


100621 


3AORF243 


-3 


23558. .23680 


40 


aaaattaactgtttattaattttaa 


ata 


taa 


100622 


3AORF244 


-3 


1697. .1819 


40 


catttcattaaaggattattattaa 


ata 


tga 


100623 


3AORF245 


1 


19015. .19134 


39 


agttatgcaaggaatatgatgactt 


ttg 


tag 


100624 


3AORF246 


1 


22504. .22623 


39 


get aat c t aaacact 1 1 cacat cgt 


ttg 


taa 


100625 


3AORF247 


-1 


40567. .40686 


39 


aaagtatttacttgttctttattcc 


ata 


taa 


100626 


3AORF248 


-1 


23956 . .24075 


39 


tttagattcatgaaacgaagtagca 


ata 


taa 


100627 


3AORF249 


-1 


11113, .11232 


39 


cacctttccccaacacttttacagt 


ate 


tga 


100628 


3AORF250 


-1 


8719. .8838 


39 


1 1 tt at t agct t ct act age 1 1 1 aa 


ata 


taa 


100629 


3AORF251 


-2 


16899. . 17018 


39 


aactcgtctgttaagcgcttgttga 


att 


tga 


100630 


3AORF252 


-3 


37025. .37144 


39 


acaactgccctaatttaataactgc 


att 


tga 


100631 


3AORF253 


-3 


29138. .29257 


39 


tctacatactccaaacaattgatgg 


att 


taa 


100632 


3AORF254 


-3 


15476. .15595 


39 


caaatcaattcattaaaatccatta 


ctg 


taa 


100633 


3AORF255 


1 


13552. .13668 


38 


tt aat agacaaagt aaaat cgt ggt 


ttg 


tag 


100634 


3AORF256 


2 


12545. .12661 


38 


aaaagtgcaaagggctggctaacgg 


ata 


taa 


100635 


3AORF257 


2 


41870. .41986 


38 


gggcatggattaaacttacaacaag 


gtg 


tga 


100636 


3AORF258 


3 


10827. .10943 


38 


teaaacttttgaaaaaeggtttagg 


att 


taa 


100637 


3AORF259 


-1 


34570. .34686 


38 


gtgacatcgaaccagtacggatcac 


gtg 


tga 


100638 


3AORF260 


-1 


32389 . .32505 


38 


aag cagg t aagee aat acg c at t ga 


att 


tag 


100639 


3AORF261 


-1 


23830 , .23946 


38 


cctttttaacttttaataaaattaa 


ata 


tga 


100640 


3AORF262 


-1 


8158 . . 8274 


38 


ccatctcttctggttcagtttctga 


ate 


taa 


100641 


3AORF263 


-2 


14001 . .14117 


38 


ttatacctgcatttcctcctgattc 


gtg 


tga 


1Q0642 


3AORF264 


-2 


294 . .410 


38 


tttgcttgtttttattttcccttga 


gtg 


taa 


100643 


3AORF265 


-3 


42683 . .42799 


38 


tgacaaagataattatctctatcta 


atg 


tga 


100644 


3 AORF2 66 


-3 


31979 . .32095 


38 


aatcctcatcatcagtgtctaattc 


ate 


taa 


100645 


3AORF267 


-3 


26306 . , 26422 


38 


ttgtaacaacttgatttaagaatac 


ate 


tga 


100646 


3AORF268 


-3 


16490. .16606 


38 


tacatacaaggcttagcttttttat 


ttg 


tag 


100647 


3AORF2 69 


-3 


9872. .9988 


38 


tgagacccct ct aac cctgagtt ag 


ata 


tag 


100648 


3AORF270 


1 


21829, .21942 


37 


atagttaagagtcagtgcttcggca 


ctg 


tag 


100649 


3AORF271 


2 


29468, .29581 


37 


tgagcgacacatataaaagctacct 


att 


taa 


100650 


3AORF272 


3 


2955. .3068 


37 


gagttaaacagattttacttgcagc 


ata 


taa 


100651 


3AORF273 


3 


5010. .5123 


37 


tttggcaaaccagtagtatttacag 


atg 


taa 


100652 


3AORF274 


3 


19956. .20069 


37 


tcaagtatagatgaattaaagcaac 


ttg 


tga 
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100653 


3AORF275 


3 


39882 . .39995 


37 


gatatgttaccaacaggaaatgtag 


att 


taa 


100654 


3 A0RF2 7 6 


-1 


21211 . .27324 


37 


attaagtgegcttatttaattagat 


att 


tga 


100655 


3AORF277 


-1 


13516 . .13629 


37 


cgaccgtcattaaagttaagtccac 


ctg 


tga 


lUUoOD 


3AORF278 


-1 


11893 . .12006 


37 


ttttatatacacgaccactggataa 


ate 


taa 


100657 


3AORF279 


-2 


17535 . . 17648 


37 


tttgtaaagatttgtttactgctgc 


ttg 


taa 


100658 


3 A0RF2 8 0 


-2 


6474. .6587 


37 


tcaaaataagcatctaactgactag 


atg 


taa 


100659 


3AORF281 


-2 


759. .872 


37 


ttttgatatcgttgcgtcataatgg 


att 


tga 


100660 


3 A0RF2 8 2 


-3 


36608 . .36721 


37 


cccaaaacctccttgactcgatcta 


ata 


tga 


100661 


3 A0RF2 8 3 


-3 


14960 . .15073 


37 


tttcagttgaagaaccatcttttaa 


att 


taa 


100662 


3A0RF284 


1 


18859. .18969 


36 


atgttaacagagccaggtctttact 


att 


taa 


100663 


3AORF285 


2 


8237. . 8347 


36 


aaaacttatacaaaagccttgaaag 


ata 


taa 


100664 


3AORF286 


3 


5157. . 5267 


36 


tatgatcagcaacgtacattagaca 


gtg 


tag 


100665 


3A0RF287 


3 


38610 . .38720 


36 


tttgatttagtacgcatacacttat 


atg 


taa 


100666 


3 A0RF2 8 8 


-1 


36454 . .36564 


36 


tttatgacataactaccattcatac 


ata 


tga 


100667 


3 AORF2 8 9 


-1 


30217 . .30327 


36 


aacaattttttcataatgctcttct 


ttg 


taa 


100668 


3AORF290 


-1 


16678 . .16788 


36 


gcttttttgcaaattctaacagctt 


ate 


tga 


100669 


3AORF291 


-2 


14310 . . 14420 


36 


gtctagttaaagggataaccatctc 


ctg 


tga 


100670 


*5 7i rvn Tifv An 

3AORF292 


-2 


11457 , . 11567 


36 


ttctttcaattctttgattttctga 


ttg 


tga 


100671 


3AORF293 


-3 


29462 . .29572 


36 


1 1 c at aaaagt at t c c 1 1 at aaaat 


atg 


tag 


100672 


3AORF294 


-3 


22388 . . 22498 


36 


accattccaattttggccaaacgat 


gtg 


tag 


100673 


3AORF295 


-3 


18629 . .18739 


36 


aaaaggaacgcctcttgagtgaagt 


att 


tag 


100674 


3AORF296 


-3 


6332 . . 6442 


36 


tatcagacatgaagtctgaaggtaa 


ate 


taa 


100675 


3 A0RF2 9 7 


1 


13984 . .14091 


35 


aaatggttgaagtcacttaaaggta 


gtg 


tag 


100676 


3AORF298 


1 


40174 . .40281 


35 


tatcaaatgttgcatcattttttga 


gtg 


taa 


100677 


3AORF299 


2 


1481. .1588 


35 


gccgcgtgtgctacttttgcgttag 


ata 


taa 


100678 


3AORF300 


2 


40451 . .40558 


35 


aatataaattttgtatacaaggtga 


ata 


tag 


100679 


3AORF301 


3 


25479 . .25586 


35 


accactagttaaaacttcatatact 


ata 


taa 


100680 


3AORF302 


3 


32106 . .32213 


35 


gaagatgatttcgatgaattagaca 


ctg 


tga 


100681 


3AORF303 


3 


36024 . .36131 


35 


gacacagagggattattaaaagaga 


ttg 


tag 


100682 


3AORF304 


-1 


37762. .37869 


35 


accgacaaatccgccaacatctttt 


ata 


tga 


100683 


3AORF305 


-1 


24088. .24195 


35 


tttatctttaacaaaatcaaactga 


ata 


tga 


100684 


3AORF306 


-1 


19507. .19614 


35 


atcattaggtaattgaaattttaaa 


ata 


tga 


100685 


3AORF3 0 7 


-1 


16081 . .16188 


35 


atgtactgacagttgcagatacagt 


ate 


tag 


100686 


3AORF308 


-1 


11398. .11505 


35 


tttctttagttctagttaaaatgtt 


ttg 


taa 


100687 


3AORF309 


-2 


33003 . .33110 


35 


aaacagacctcttacccgttcatca 


ctg 


taa 


100688 


3AORF310 


-2 


24894. .25001 


35 


gtaaatcgaaatcgctaccagctga 


att 


taa 


100689 


3AORF311 


-2 


22005. .22112 


35 


ttcgtaggtgtcattacttctttaa 


ttg 


tag 


100690 


3AORF312 


-2 


21711. .21818 


35 


aaaataaaaagccagtgccgaagca 


ctg 


tag 


100691 


3AORF313 


-2 


17901. .18008 


35 


cattaggtcttagacgacttagcat 


ata 


taa 


100692 


3AORF314 


-2 


16710. . 16817 


35 


taattcagtcttaggagtatcattt 


att 


tag 


100693 


3AORF315 


-2 


15990 . .16097 


35 


acatatctccgtatcatttgggtaa 


att 


tag 


100694 


3AORF316 


-2 


2862. .2969 


35 


aattcttcttcatactgtttgacga 


ttg 


tag 


100695 


3AORF317 


-3 


40217 . .40324 


35 


tccctaacactactttttaaacttt 


ata 


tga 


100696 


3AORF318 


-3 


37535 . .37642 


35 


tgttcggctcctttcattattttaa 


ata 


taa 


100697 


3 A0RF3 1 9 


-3 


34421 . .34528 


35 


ttcttcatcttttatttgactctgc 


ata 


tga 


1 ft ft /■* ft ft 

100698 


3 A0RF3 2 0 


-3 


28262 . ,28369 


35 


catttgttggtaatatcttagttcg 


atg 


tga 


100699 


3AORF321 


1 


23989 . .24093 


34 


taaaaaggtttaatataaaaatgta 


ata 


tga 


100700 


3AORF322 


1 


34660 . .34764 


34 


aagagaagattgagaccatggcttt 


atg 


taa 


100701 


3AORF323 


3 


30105. .30209 


34 


ctaaatactgaactatcaactgtag 


att 


taa 


100702 


3AORF324 


3 


30258. .30362 


34 


ggaaaagagttccttaaaaaagcag 


ata 


tga 


100703 


3AORF325 


3 


40236. .40340 


34 


gttgtatcatttttggtgatgcaac 


att 


tag 


100704 


3AORF326 


-1 


36964. .37068 


34 


cgcatcaacaactgtaaacctttga 


ttg 


tga 


100705 


3AORF327 


-1 


35242. .35346 


34 


atttttgtctgttgtataatatttt 


ctg 


taa 


100706 


3A0RF328 


-1 


21916. .22020 


34 


ccatttaccttcttgagatgttgga 


ttg 


tga 


100707 


3AORF329 


-1 


18820 . .18924 


34 


ggtggcttaacttccaagaaccaac 


ctg 


taa 


100708 


3AORF330 


-1 


15631 . .15735 


34 


ttatgaagttttcacaaattagtaa 


ate 


tag 


100709 


3AORF331 


-2 


37998 . .38102 


34 


ttacgcccaatagcttcatactcat 


ctg 


tag 


100710 


3A0RF332 


-2 


7359. .7463 


34 


tttataaacctttaaagttttagtc 


ata 


taa 


100711 


3AORF333 


-3 


24584. .24688 


34 


aaaaat tat aaaact ataaaaccat 


ate 


taa 


100712 


3AORF334 


-3 


24269. .24373 


34 


tatttttaggtagataatttattaa 


ate 


tga 


100713 


3AORF335 


-3 


14273. .14377 


34 


cacttcagcaagttgatgctttgta 


ate 


tga 


100714 


3AORF336 


2 


7559, .7660 


33 


gtaactttatctaatttagaagegg 


ata 


tag 


100715 


3AORF337 


2 


13277. . 13378 


33 


aatataggtaaaaaagcaggagaat 


ttg 


tag 


100716 


3AORF338 


3 


9501. .9602 


33 


taggacgtacgatgacgatgggcgt 


ate 


taa 


100717 


j£\\JI\.r j »5 _7 






33 


atatctaattaaataagcgcactta 


att 


tga 


100718 


3AORF340 


-1 


37372 . .37473 


33 


ttctatggttttcatcttatgagaa 


atg 


taa 


100719 


3AORF341 


-1 


33421. .33522 


33 


aagctaattcggacacttttccttt 


ttg 


taa 


100720 


3A0RF342 


-1 


29047. .29148 


33 


tttggcatctctatcactcctttag 


ata 


taa 


100721 


3AORF343 


-1 


7549. .7650 


33 


atgatacgcctgagactagaattgg 


att 


taa 


100722 


3AORF344 


-1 


7297 . . 7398 


33 


ctgctgaaactgttgcagattttga 


att 


tga 


100723 


3AORF345 


-2 


23850. .23951 


33 


ttaaacctttttaacttttaataaa 


att 


taa 
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100724 


3AORF346 


-2 


20607 . .20708 


33 


aaaga t gt a c ga c t aga 1 1 1 agt t a. 


ate 


taa 


100725 


3AORF347 


-2 


14175 . . 14276 


33 


— . 1— /vJ* 4* >-«4— 4- —\ -~« ~i y ^ ^^l/T^I-f* "i +- ■"] ~1 

aLCtyttytLaaayaaCyCtaataa 


ctg 


taa 






_o 
- z 


0301 i p /UOj 


33 


cgtacactggttgacctgttaaacc 


ate 


tag 


100727 


3AORF349 


-2 


6882, .6983 


33 


tagaacgaccaataactgtatttag 


ate 


taa 


100728 


3AORF350 


-3 


40748. .40849 


33 


aactgcaattcactaaatgctgtaa 


gtg 


tga 


100729 


3AORF351 


-3 


38345. .38446 


33 


ggttagtagaatgtttttcgtataa 


ate 


taa 


100730 


3AORF352 


-3 


38081. .38182 


33 


tagttgaaggccaatacattaacct 


atg 


taa 


100731 


3AORF353 


-3 


35432. .35533 


33 


tagcattctcatatgatgcagattt 


at a 


taa 


100732 


3AORF354 


-3 


34952. .35053 


33 


ttatcctgatacagatatctcttag 


ate 


taa 
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Table 9 



Bacteriophage 96, complete genome sequence 

1 catagttata ggcttttcag ctatatacca agataagatt tatcccgccg tctccataaa aatatgcttg 

71 gaaaccttga tttaatgggg ttttaatcta gcaagtgtca aatatgtgtc aagaaaataa ttttctgaca 

141 cgttgacctt gctctttttt atgttcatca agtaagtgag agtaggtgtc taaagttata gatatattat 

211 aatggcctaa tcttttgcta atatattcaa taggtatacc tttagaaagt aggaaagatg tatgcgtgtg 

281 tcttaatgaa taaggtgtta ttgtagtatc atttagtcct atttgactct tagcatggtt aaatgacttt 

351 ttaacggcat tatgactcaa tttaaacaac ttattatctg tacgttttgg taattttgat aatttagctt 

421 taatatgttg tatatccttt tttggtacct ccacaagtct gtccgcgtta actgtttttg ttccacgaag 

491 atgtattgta ccctcttttt cgtttagatc gataggcaac atattaatta catcgctgta tcttgcacca 

561 gtgatagcta ggatgaataa aaaaatataa ctcgattcgt ctctagattt aaagtattct atcaattgca 

631 agtattgttc tatggtgatg aatttagagt gttcgtcttt tgattttttt gtaccacgaa tatctatttg 

701 atagctaggg tctttcttta aatagccctc atatactgca tctctgaagc attgtgataa acaactgttt 

771 aatttacgaa ccgtttcatt agtacgacct cgaccgaatt cgttcaaaaa cttttgatac tccgaacgtt 

841 tgatgttttt tattaaaaaa tcactcccga aatattcgtt aaataatttt aatgaacgtt gataccaata 

911 gaattgttgt gaagcgacat gtttcttatt ttttgaatct aaccaatcat tgtaatattc ttcaaacttt 

981 ttattttcat ctaaattgtt tccatcatcc aaatctctaa gcagttgttg agcagcgttg gttgcctcag 

1051 ctttagtttt gaatcctgac tttcttttct ttcctgattt gaaagacgga tgttttacgt cgtactgcca 

1121 agatgctgtt gctttattct tcctttttgt aattgtaaat gacgccattt tacttttcct cctcaaaatt 

1191 ggcaaaaaat aataagggta ggcgagctac ccgaaatttt attgttgaac aactattgct tcacttcttg 

1261 cttttcctac ttcttttcta aaactatcat atgattgatt agggtgtgtt aacgacattc ctggaccacc 

1331 tccagcatgt tggtttttgt ccggattatt ttccatttct tcagtggctc ttttagcatt taaatattct 

1401 tcgtaactag gttcgtttgg gtcgcgtggt tgtgcttgtt gtccattatt ggtagctgga agattcttct 

1471 gtacctgttg cttagatgtg ttattggttt gttgattgtt gttaatgttt gtgttgttct cgttgtttac 

1541 ttgattattg ttatcgtttt gattactatt ttcttttttc gcttctgctt tatctttagt ttctttcttt 

1611 ttgtctttgt tctctttctt tgtttcggtt ttcttgcttt cctctttctt atcgccgtcg ttgctaccgc 

1681 atgcacctaa cactaacgca ctagctaata ataaaactaa taatcttttc atgttttaca ctcctttatt 

1751 tgctatttgt tttaataaat ctatgatttc attgttttgt tctatgattt tgttttcatt tttaagatgt 

1821 tcgtctaaca tctctattaa gacgaaattt tgatttatca tttcgtaagt aaacatttga cctgtgttgt 

1891 taggattaga aaacgaacta ctgaaacgcg ttgaaaagct atctataaat tgaccaactt tattttttaa 

1961 taacatatct ttaccgctct cagacattgt atttagttcg cgcttattta aagttttttc tataattttg 

2031 tattttgttt cctgatttct ttcgatttct tctacttcaa aagggatatt gttattaaat ttttcgataa 

2101 tatcacgttt ttcagaaact gacatacgat caaatacttg tttttgacct ttatttaact tccctcgaat 

2171 ttttccggca gtccaagact ctttaactgt taacttatca ttaggaactt gattcatctt ttatatgact 

2241 ccttttctca tatttcttta tatttaaaaa ctctcaacgg ctcaaatgta atcgaatact cgccatagtg 

2311 agttccaata ccgtatatct tcttatattg ttctattgcc tccaatatgt attcttcgct taattgtaga 

2381 tactcagaca actcatacaa gttacgtacg ccataattgt aagcttctac aatttcgcgt aacgggactg 

2451 ctgagataaa gccgtgtcgt cttgcgtaat tttcgaactt gcgattgttg aatttcgatt gatctaaaat 

2521 gttgccatac gtcaacttgt ggtgggcaag ttcttcatat aatacttcta atttgttcct ttcggataag 

2591 gaaggtctaa taaaaatttc tccttcttga taccaaccat cgaatcctcg aggtactctt tgtgtttctt 

2661 tcacttcaac ttcacatttc ataagcaatt cttcgtattt tcccatgcgc caaacccctt tggtgtctta 

2731 tttctttcta tctctaaccc attgcataaa attttcgatt tcttcccatt cttcgggagt aaattcatct 

2801 ttatttgcat gaccggctat agtttcttga tgaatacttc tttcttctgt aattctcgat ttaggtacat 

2871 taaagtaatc tgctaattgt tggacttttg atattctagg atatttaagt tctttaagcc agttagagat 

2941 tgttgattga cttaccccga ttgcttcaga caattctact tgagtaatgt tgttctcttt cataagttgt 

3011 tctaagttct ctgataaaat ttttctagca ctcttatatt ccataatttt ctcctttagt attacttaat 

3081 gtaatactaa tttaccataa gtaatatcac ttttcaatac aaaatattac ttttttgaaa taaatatcac 

3151 tttaggtgtt gacatattac tttaagtgat agtatagttg taaatgtcaa cgggaggtga tacgaaatgc 

3221 cagaaaattt taaagagttc tctgtaaagg tctggagaac taattcgaat atgacacaac aagatgtcgc 

3291 tgataaatta ggcgttacta aacaatctgt aataagatgg gaaaaagatg acgcagaatt aaaaggctta 

3361 caattgtatg ctttagccaa attattcaac acagaagttg attatataaa ggctaaaaaa atttaacatt 

3431 aatatcactt taagtgataa aggaggaaac tgaaatgcaa gaattacaaa catttaattt tgaagaatta 

3501 ccagtaagga aaattgaagt ggaaggagaa cccttctttt taggtaagga tgttgctgaa attttagggt 

3571 atgcacgagc agataacgcc atacgcaatc atgttgatag tgaagatagg ctgatgcacc aaattagtgc 

3641 gtcaggtcaa aacagaaata tgatcatcat caacgaatct ggattataca gtttaatctt tgacgcttct 

3711 aaacaaagta aaaacgaaaa cattagagaa accgctagga aattcaaacg ctgggtaact tcggaagttt 

3781 taccgacgtt aagaaaaact ggtgcttacc aagtacctag tgacccaatg caagcattga gattaatgtt 

3851 tgaagctaca gaagaaacaa aacaagaaat taaaaacgtg aaagatgatg ttattgattt gaaagaaaat 

3921 caaaaactgg atgcgggaga ctacaatttc ttaactagaa caatcaatca aagagtagct catatacaaa 

3991 gactacatgc gataacaaac caaaaacaac gtagcgaatt attcagggat attaattcag aagtgaaaaa 

4061 gatgactggt gcgagttcaa gaacgaacgt aagacaaaaa catttcgacg atgtaattga aatgattgct 

4131 aattggttcc cgtcacaagc tactttatac agaatcaagc aaattgaaat gaaattttaa aacgaaatat 

4201 aggagaggct gaatatggaa tacatcggat atgcagacgc aaatgcgttt gtaaaaataa gtggcatttc 

4271 aaaagatgat ctagagaaaa aagtctactc gaacaaagag tttcaaaaag aatgcatgta cagatttggt 

4341 cgaggacaaa agcgttatat aaaaattgac aaagctattc aatttatcgg taccaattta atgattaatg 

4411 aatacgaatt ataggaggag ttatcaaatg agtaaaactt ataaaagcta cctagtagca gtactatgct 

4481 tcacagtctt agcgattgta cttatgccgt ttctatactt cactacagcg tggtcaattg caggattcgc 

4551 aagtatcgca acattcatat actacaaaga atacttttat gaagaataaa aaaactgcta cttgcgtcaa 

4621 caagtaacag tgacaaacat ttatcaaaat atacaactta attaaatcaa aatatacgga ggtagtcaac 

4691 tatggctgaa aatattaaaa ctgaacaaca ttattacact aaagatttct caggatacag aaatgaagaa 
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4761 gataactttg tagcaaatca agaattgaca 

4831 taaaggctgt taaagataaa gaagaagata 

4901 aaaattggaa aaagaaaata taaaactaaa 

4971 gaggacgaag aagacaagga ggacgagaac 

5041 agctttaacg ggtttgaatt taaagtgtct 

5111 atatgaataa tgttccactt aaatcgtttc 

5181 aatgcgtgac gttataaacg aatggattga 

5251 atgaaatggt aggaggtatg aaaagtgaat 

5321 accgattcaa agtaactgat ctagacagtg 

5391 agagaatgaa atcaacgatt tagcaaataa 

5461 gaaaaattac agagtggcaa agaatattta 

5531 aagatagcaa attcaagttg aatacacctt 

5601 agttagcaat gagcaagaag tcattaaaca 

5671 aaaaaactta gccaatcaga cattaagaaa 

5741 acggcgaagt tttagagggt gctagcattg 

5811 gatgactgaa aaaactaatc aagatgtcga 

5881 aatgcaaaca agttttataa atttgcgata 

5951 aagataacaa taccttagta ctagatataa 

6021 gcagattaag aattataagc attttagtgc 

6091 gaaaacggaa aacaaattga tgttgtagtg 

6161 acatcatgga cggtaaatca aagaaaccga 

6231 tatttatcgt tatatttcta aattacaaga 

6301 attaacaaag acaaagatga tgagggaagt 

6371 taaaaaaagc agtcatcagt caatctgacg 

6441 cgaaaaaact tatcaatatg tacttaacgc 

6511 agcaacatca aaattaacaa caaacgtttc 

6581 atggtaatta aaaattaatt aaaaggacgg 

6651 caagaaacta atcaagaggc attcatgaaa 

6721 ttgcaaatgt cgagtttaac gacagagaaa 

6791 acaatacaaa cacaaccaat tcgtcccacc 

6861 cttagtagat taggaattaa attgaactta 

6931 tcggaactat tgtacttaaa aataaattta 

7001 aaaagtttgg aataaagacg atgaagtagt 

7071 gaacagcaag caaatggtaa acagacacct 

7141 caatagaaat caatgatgat gatttaccgt 

7211 gaaagacaat gacggtactt attccgtcgt 

72 81 ctagaaaacg gatatccgct aaaagcagaa 

7351 gcaaaaaaat attcgcaatg tgtagagata 

7421 attattacaa acagaattgg aaattatgaa 

7491 gttgcgagag agttaataga actgattata 

7561 cgagtaagtt gttaagcgaa gataaagcgt 

7631 cggaaagcct cacgcagacc tggcacatta 

7701 cactacgaca aacatgtgtt agcactgtgt 

7771 cgtttgatga taaatatcaa ttgcatgact 

7841 aggagagaaa aatgaataag ttactaatag 

7911 aatagggtta aacgaagcaa tagtattgca 

7981 gatggcaaaa cttggatttt taattcttat 

8051 ctataaaaag gacatttggg agtttagaaa 

8121 atttgaccgt acaaaatggt attcaatcaa 

8191 caaaatggcc cgacgatgag gacaaattgg 

8261 actacacaga gactaacaaa catagagaga 

8331 tttagaaatt atacaaaacc ctttaaaagc 

84 01 cagttcgaaa tagtaaaagt cgctaccgat 

8471 ctgtattaaa gaactggaat aaagaaggcg 

8541 taactctaaa aaagaaacta ctgatgatgt 

8611 cgatgagcaa aacacaagca ttagaaatta 

8681 accaaagtta gaaatgtgga ttgatgtatt 

8751 gatggatata tcaacagtaa caacccgtac 

8821 aagtatctat tgagccggta gacaacgaaa 

8891 tgtcagacaa agaaaaatag cgctagataa 

8961 tgaattacgg tcaatttgaa attgaaagca 

9031 aaagataaga gttaaagatt acatgtttac 

9101 gtcggaaaga tagatcatca agaaatctat 

9171 ctataactaa actttacaac tccgatttca 

9241 ggaaagttat caaatcaaca aagcgaaaga 

9311 tttaataact tgattgatga actcaaggat 

9381 agtttgttga ggagtttgtc gatgagttat 

9451 gctcatggat tacaaaatag ggggattgga 

9521 ggtaagacag gttttgcatt aaacatgatg 

9591 gtctcgaaac aactggcaca tcagtattga 

9661 gataaaagaa atcaggaact taacgccgga 

9731 aaattaggca tcgatatttc tgataaaagt 

9801 attcagacag gcaacaagtt atttttatag 

9871 acgtgtagca gtagaaaaga tatcacgtga 

9941 ctactttcac aactgaatcg tggtgtcgag 



gtaacaatca cattgaacga gtacagaaaa cttattgaaa 
cttacagagg taagtatttt gcggaagaaa gaaaaaacga 
aaacaaaatt tatgaattac aaaacgaaga agataacgag 
gatgtattac aaaattggtg agataaaaaa caaaattata 
gtgatgaaga gacatgacgg tatcagtata caaatcaagg 
atgtcataga tttaagcgaa ctatatattg cgacggatgc 
aaataacaca gatgaacagg acaaactaat taacttagtc 
gatttacaag agagagaatt agaaacattc gaacaagacg 
ctaactgggt ttttaagaaa ctggatgcaa tcacaactaa 
agaaattgaa cgcataaacg aatggaaaga taaagaagta 
caaagccttg taattgaata ttacagaata caaaaagaac 
acggaaaagt gacagccaga aaaggttcaa aagtcattca 
acttgagcaa cgaggttttg acaactatgt aaaagtaact 
gatttcaatg taactgaaaa cggcacattg attgacgcaa 
tggagaaacc aacgtcatac acggtaaagg tgggagaata 
tattttaacg caactaggtg taaaagacat cagcaaacaa 
tacggcaagt tcggtactgg taaaactacg tttttaacaa 
atgaggacgg aacaacggta acagaagatg gggcagttgt 
agtgattaaa atgctgccta aaattattga acaactaaga 
attgaaacaa tccaaaagtt acgtgatatc actatggacg 
catttaatga ttggggcgag tgtgctacac gcattgtaag 
acattatcaa tttcatcttg ctataagcgg acacgagggc 
actatcaatc caacaatcac gatagaggca caagaccaaa 
tgttagcaag aatgacaata gaagaacatg agcaagacgg 
tgaaccatca aatttattcg agacaaagat aagacactca 
attaatccaa gtattaacga tgttgtacaa gcaattagaa 
tataaaaatt atgaaaatca ctggtagaac acaatacatt 
ggtggggact ttttaggagc tggagaattt acagtaaaag 
acagatactt cacgattgtt tttgaaaaca acgaaggtaa 
attccaacaa gattatcaag aaaaacaata tatcgagtta 
ccagatttaa cttttgacac agatcaatta attaacaaaa 
acgaggaaca aggcaagtat tttgtaagac tctcatatgt 
taataaacca gaacctaaaa ctgatgagat gaaacaaaaa 
atgagtcaac aatcaaaccc attcgctaat gctaatggtc 
tctaggacgt ggtttaaatg caatacatta caagatacca 
tgctactggt gttgaacttg aacaaagtca cattgattta 
gtagaggttc cggacaataa aaaactatct atagaacaac 
tagaacttca ctggggcgaa ccagtagaat caactagaaa 
aggttatgaa gaaatcagtc tgcgtgactg ttcaatgaaa 
tcgtttatgt ttcatcatca aatacctatg agtgtagaaa 
tattatattg ggctacaatc aaccgcaact gtgtaatatg 
tgaagcagtc ggcagaggta tgaacagaaa caagatgaat 
agacaacatc ataatgaaca gcacgcaatt ggtgttaagt 
cgtggataaa agttgatgag aggctcaata aaatgttgaa 
atgactatcc gatacaagta ttaccgaaat tagctgaatt 
acaaattcat tattggctaa acaactcaaa acataaatac 
ccagaatggc aaaaacaatt tccattttgg agcgagagaa 
aacaaaattt attgcatgta ggtaactaca acaaggctgg 
ttatgaaaca ttaaacaaac tagtggcacg accatcggga 
cacgatgcaa gaggacaaaa tgacccgacc aataccatag 
cagacgacgt ctcaaagtca tttaagtata ttagtaccaa 
agaacagtta gaacacgaaa ttaaatcatt taagcaagat 
tactgcaaag aaaacaacaa aggtctgaat tacttactaa 
tttcagataa agaaagtgct gaaaacaaat tgaaacctcg 
catagcacaa atggaaaaag aattgagtga tgactaatgc 
ttaaaaaagt taggtacgta tacaacatcg attttgataa 
aagtcaaaac ggggattatc aaccaactgt aaaagctgta 
ccgcctaacc taccagcaat catgcgtaag gcacctaaaa 
ccgctacaca ccaatggaaa atgcagaatg accccgaata 
cttcatgaat aagttggcag aatttggggg cgataacgaa 
caataatcgc tacgctactt aaacaaccgg acgtactaga 
gaacgaaaag tttaaaacct ttttcaatta tgtaatggac 
ttaaaagcaa ctaaagataa agagttttta gatgcagata 
ttggatacgg attctttgaa cgttatcaac aagaattatt 
attggtaact gagttcaaac aacaacctac gaaccaaaat 
ttaaaaacaa ttactaacag aaaagaagac ggaaccaaga 
acagcgatag ccctaagaag caaattaaga cgggttataa 
gccgtcgcaa ttaatcgtca tcgcagcgcg tccctcagtg 
ctgaacatag cacaaaatgg atacaaaaca tctttcttta 
aacgtatgtt atcaacaatt actggtattg agttaacaaa 
tgacttaaca aagttaacga atgcgatgga taaaatcatg 
aatatcacac cgcaagatgt gcgagcgcaa gcaatgaggc 
attatcttca actgatggat actgatgcga aagttgatag 
cttaaagata atcgctaacg agacaggcgc aatcatcgta 
tctagacagg ataaaagacc aatgctatcg gacatgaaag 
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10011 aatcaggcgg aatagaagca gatgcgagtt 

10081 cgaagatgac agtatcactg gcaaatctat 

10151 ggaataattg aatttgagta ttacaagaag 

10221 cttattgaaa tcgatgtatg aagagacaaa 

10291 ggttgggcgg tcaatagatt gttggacaat 

10361 agaaaatcat gaatgaaatc aactggaaga 

10431 atattactta taccgagaag atggcacgga 

10501 gtttattcgc tcacaggagc ccatttcagc 

10571 ttaaaggcgc tcacgggctt ctatatgagc 

10641 gtggcacaat gagtaaatac aatgctaaga 

10711 gtgcgaatat taccaatatt tagaaagtaa 

10781 aaatttgaat tacaacctaa attcgggaaa 

10851 aggaagggaa actggttgaa gttatagacg 

10921 gatattcaga tatcagtata gagatgtgaa 

10991 gaatggatgg tatatgagga cttagtgaaa 

11061 caacaacaag catatataaa cgcaacaatt 

11131 atgatgtgga taaagaaaaa gatacgctgg 

11201 tgacaacata acaataagac atgcatatat 

11271 aatttaaggg taatcatggc tagagataac 

11341 gatcaactat tagtggatat aaaaacggaa 

11411 agatgctcta ggtgttaatg taagtgaact 

11481 attaaaaaag taaatgtata gaggtggaat 

11551 taccgaacat gaaaatgaat tgataaaaaa 

11621 ggtggctggg cgttgttaga agccttacat 

11691 tgttatccaa aatcatggag cgagagagca 

11761 actacgtaag aagaagccac atttgtttaa 

11831 gtcacttata accaaatgtt caagaaatgg 

11901 tatgaacaaa acgcaagaca atgttaaaca 

11971 tttatcgaac aggttacggc acagtatcca 

12041 tgtctagagc accgttaaag aatggtcatg 

12111 tgacttgtgg gagggttaac gatggcaacg 

12181 tgggattaga agactgtgaa aaatatacag 

12251 tgtgattgaa aactataaga caagcatatg 

12321 cctaattgtt aaaaggagtg atgaccatga 

12391 ctctaagaga tatctgtatc aagacaacga 

12461 tttcacggac attataaaac gatgtttaaa 

12531 tatatataaa gcaacatgat ttggaatatg 

12601 ataatggcaa agattaaaag aaaaaagaag 

12671 ctgaacaagt tgaaagtaaa gtgtttcaat 

12741 tttttcaact gatgggcatg ggttttatac 

12811 acagaggaag tcactgaaga tactgagttt 

12881 tatatgaaaa tgattcaatc agagagttga 

12951 taaaactatg acattaattt ggaaagatgg 

13021 gtatcaagtt ctttgcatct aaatgtgatt 

13091 ggaaaaagcc acgaacatgg tattaaaact 

13161 gtcgaaaaag tggaggcaat ataatgatac 

13231 tggagctgaa aatgttgact ctatcactga 

13301 gtttttaaag acgaacgtga tgagtacaag 

13371 gaaaacgtaa cgaagagctg gagaacatgt 

13441 ttactgtttt aaaattagag aactacaccc 

13511 ggtaaaagca ctgcagatat tatactgtcg 

13581 ttttagggca aatggaggca gacacaaatg 

13651 gtgcgtataa cggtaatgac acagaggggt 

13721 gtttgatgaa atacttgagg gaatgacaaa 

13791 gaagcagtag gggttatggc aggtcaagtt 

13861 gtaggagata aagtatataa ccatgaaaca 

13931 gagatacaca ttataaactg tctgatgatt 

14001 tctaattaag ggggacgagt gagtggaatg 

14071 aataaaaatt taaagtcggt atacgtaaca 

14141 ttgaaatttt taatgaagaa gtgttattaa 

14211 ttggattaat cctaaatctc ataagacgcc 

14281 tttgaatttt tggaggacga gtaaatgctt 

14351 atttaaacga cgactggtgg tacgagctag 

14421 tgatattgat agagtgctta aatttattga 

14491 taacagtaga tcaattaaaa gaacttttac 

14561 tttaaatgac acagtagcta gtatgattat 

14631 aattggaaga aacaaccagg taagccatta 

14701 gtttgcaatt aactctgact attgttgatg 

14771 gattgaaaat gaagttactt tacctaaact 

14841 gaacaatttg taaaaggtat tgataatagt 

14 911 cttactatac aatcgaccaa ctcattgacg 

14981 tggaacagca gacgcaggaa aaggatacgt 

15051 gtgacgcaat acttagtcac aacattcaaa 

15121 gagataatca gacgtttaca gttgttgagg 

15191 agttaagata aggagagatg gagatgccaa 



tagcgatgct actttaccgt gatgattatt ataaccgtga 
tgttgaatgt aacatagcca aaaacaaaga cggcgaaacc 
actcagaggt ttttcacatg aatataatgc aattcaaaag 
gcaaagcgac ccgattgtag caaatgtata tatcgagact 
aacgagttat cgcctttcga tgattacgac agagttgaaa 
aaacacacat taaggagtgt taaaaaatgc cgaaagaaaa 
agatattaag gtcatcaagt ataaagacaa cgtaaatgaa 
gacgaaaaga aaattatgac tgatagtgac ctaaaacgat 
aagagctagg attgcaagca acgatatttg atatttagag 
aagttgagta caaaggaatt gtatttgata gcaaagtaga 
tatgaatggc actaactatg atcgtatcga aatacaaccg 
caaagaccga ttacgtatat agccgatttc tctttgtgga 
ttaaaggtaa ggcgactgaa gttgccaaca tcaaagcgaa 
tttaacgtgg atatgtaaag cgcctaaata cacaggtcaa 
gtcagacgta aaagaaaaag agaaatgaag tgatctaatg 
gatataagaa tacctacaga agttgaatat cagcattacg 
caaagcgctt agatgacaat ccggacgaat tactaaagta 
agaggtggaa taaatgaagt tgaacgaagt attcgcaact 
gtaagtgtcc aagatttgca caatgaaact ggcgtatcaa 
aagctgagat ggttaactta aatgtattag ataaattggc 
atttactaga aatcacaaca cgcacaaatt agaggattgg 
aaatgagtat cgtaaagatt aacggtaaac catataaatt 
gaacggttta actccaggaa tggttgcaaa aagagtacga 
gcaccttatg gtatgcgctt agctgagtat aaagaaattg 
aagagcgtga aatggttagg caacgacgta aagaggctga 
tgtgcctcaa aaacattctc gtgatccgta ctggttcgat 
agtgaagcat aatgagcata atcagtaaca gaaaagtaga 
accggcgcat tacacatacg gcaacattga aattatagat 
cctcaactag cattcgcaat aggtaatgca atcaaatact 
aggatttagc aaaggcgaag ttttacgtcc aaagagcttt 
caaaaacaag ttgattacgt aatgtcatta caggaacaat 
acgaacaagt taaagctatg agtcataaag aagttagcaa 
ggatgaagag ctatataacg aatgcatgtc gtttggtctg 
acgatagcgc acgcaaagaa tacttaaacc aatttttcag 
gcgagtggca catatccatg tagtaaatgg cacttattac 
ggcgtgaaaa agacatttga tactgctgaa gagctcgaaa 
aggaacagaa gcaaccaact ttattttaga ggagatggaa 
atgacgctac tcgaactggt ggaatgggca tggaacaatc 
cagatagaat gggcacgctt ggagaatgta gcgaagtaca 
aaaagtagta acagataaag atatttttac tgtagaaatc 
gattgtctag tagaactaaa cgatattgaa ggttttgaaa 
tagacggtac ttccagagcg ttttatatac taaacgaaga 
ggagttggta gtatgatgca aacctataaa gtatgtcttt 
ataaattaaa gaaacattat ttcgtgaaaa gtacgaatga 
gattcgtaaa aagctcccgt tcgaaactgc aagcatagaa 
aaccaacaag agaagaatta attaatttca tgaaaaaaca 
tgagcaaagt gcaataagac actttagagc tcaatcaaaa 
aagcaacgag atgagcttat cgaggatata gctaagttaa 
ggcgcacagt caaaaatgaa ttgcttggaa gatacgaaca 
tgagagcaaa gcgaacagga taggagctct ctatatagga 
cgaatggaag aactagacgg aacaaatgag ttctacgaat 
aataaccgtg aacaaataga acaatcagtg atcagtacta 
tactaaaaga gattgaggac gtgtataaga aagcgcaagc 
tgctattcaa cattcagtta aagaaggtat tgaacttgat 
gtctataaat atgaggagga gcaggaaaat gagtattagt 
aacgaaagtc tagagattgt gcaattggtc ggagatatta 
cagttattag cattatagat tttattacta aaccaattta 
gaaacgatta aaaaatgtgg tgccgcaccc agttatcaaa 
aaagataatg tgaaagaggt tcaaaaagaa ttaggtttct 
ctggattttt atcatttcaa aggataccta tttacattat 
tagatattac tttgctaacg agcatgagat tgaaagatat 
gaaatcatcg accaacgtga tgcattgcta gaagaaaagt 
attattggtt gaataaacgc aagtcagaaa atgaacagat 
ggaattaaaa cgataggaga taacgaataa atgaataatt 
aaatacaaaa ggagttcgac gatagaatac cgactagaaa 
tgaatttgcg gagtgggtta acacacttga gttttttaaa 
gatacacaat tagatgagat tgctgattac ttagctttca 
aagaagattt ggaagagact actgaggtta tggttgattt 
acattcagtt tattttgttc atgtaatgca tacactaaca 
attgtacaag ttttaataat gccttttttg tacgccaata 
catacaaaaa gaaaatgaaa aggaaccacg aaagacaaga 
gtaaagacat cttagatcga gtcaaggagg ttttggggaa 
gattcaacag gacaaccaca tgaacatttt actgctgcta 
cggagagtaa agaaggagcg aaagagaagt acgagaaaca 
agaaaacggt aacgattgat gtagatgaaa acttattagt 
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15261 agtagctagt aatgaaatat cagaactatt 

15331 ggcgataata gagatatcga aaaaaaaaga 

15401 catgtcgagg aggcagacga tgattaacat 

15471 aagaaatata aaaataaaac acctgaagaa 

15541 ataaagacag tgaattttac agtcctatga 

15611 aatgatgcct agtttaattg atactggaga 

15681 cgatatcttt attgctggaa tactgcgatt 

15751 atctatacag tggctagtta ccaaaacaaa 

15821 gacaagataa agaagacaag ttctatattg 

15891 attcaaaaag aaatttgata gcgcagacat 

15961 acgattggtt atagaataca ctttttaaat 

16031 aatgattaaa caaatattaa gactattatt 

16101 aaagtatata ttatgacgac ggctaatgat 

16171 agtctgattt gatgagggcg gaggtgtcag 

16241 cacacataaa atgaagtgta atgtattagc 

16311 tatggcatac aagcaacgtt gccgaaacca 

16381 ggcttgagag agcaaataaa aggtatgctc 

16451 gagattggga cacgttgact tttgcttctt 

16521 aagaagatgc ctaactctaa attaaataga 

16591 tttatctact acagtgacga aaatgacaaa 

16661 gaattaattt tatataattg atttgtaaga 

16731 tcatcgatgt gattgagaag tgacaaacat 

16801 gcctatatgg tgggtagttt aattcttgca 

16871 acgtagctcc tgcacaagat gtaggggcat 

16941 tgtgatacag tgaaaacaat tgattaaatt 

17011 ggcaagagcc aacggaaagc atatattgac 

17081 ataaagaagc gagtacactt tttaaaaatc 

17151 agctgaacaa tcaaaatgga cacgccaaaa 

17221 aatgacattg aaatagaggg agtgaagaaa 

17291 atagaatgac gttaggtaac gaagttttag 

17361 gaagattgaa caaatagata aaggtgacag 

17431 acggaagtga tcgtcaatga ataaacttaa 

17501 acgcaaaaac aagattggtt tatgttaatt 

17571 atgacttatt tttacgtgag ttaatgcgtg 

17641 atatatactt gctggtgcaa cattaggtac 

17711 ggcattgagt ttaattttga taaatataat 

17781 acagtaaagt aagtggtata ggagctatac 

17851 gttagcgcat gaagaggtgt ttgacgagat 

17921 gataccaacc ctgaccatcc cgagcattgg 

17991 gt at act gag tcaccaattt aagctcgatg 

18061 ggcttcaaca ccatcaggta tgttctatga 

18131 gtatatgccg actttgattt gaatgagaat 

18201 aatactttgc tggtgtcgac tggggttacg 

18271 tggtaacttt tattttattg aggagcacgc 

18341 aaagatattg taagtagata tggcaatatt 

18411 aatttagaag acatagatta cgtgcaatta 

18481 taagttgttc aaacaaaaca agttacttgt 

18551 aaatatgttt ggcaccctac aaacggagag 

18621 atgccatata cacacatact aaacctgaac 

18691 gatattgaag cacaaggaat attgcctaag 

18761 agagaatggt taatctctat aatagataca 

18831 aattgaagaa aaagaagatt ttgaaactgg 

18901 cttaacaact cttttgacag cgaaattgtt 

18971 atgatttaga tgaaaacgca gaaaaaaacg 

19041 tagtgttgat gatgaggatt ctgaaatagg 

19111 tatattgata cgaatggtga tattaggatt 

19181 atattttaga acctacatac tcattgcgct 

19251 gtacgcagag ttttacgata atgcttatta 

19321 gttggacgat atgaacattt atttgattac 

19391 gagatgctga aaaggttatt cacttaattg 

19461 tagtcagaca cgtttagcat accttgtgtt 

19531 caaaagagtg gcgcatttga gttgttcgac 

19601 acacaatgat tgagaaccat ttagatcgaa 

19671 taattctgac gagtttaacg gaaatgtacc 

19741 aagtgtatga cgtttgagcg taagatgaca 

19811 taaagcgtaa agggtacaac ttggatgatg 

19881 tccagttaat aagttagaag aatcacaagt 

19951 ttaggacaat cacaactagt tgatgatgtt 

20021 ttaatgacaa attacctgac atagatgaag 

20091 atattgatga gtatatcgag ggtttaatct 

20161 acttaaagag ataaaacaaa tcatcgcaga 

20231 tggactgaat tcaataaata caacaggctc 

20301 actataggca agtagctaag atgattcaga 

20371 cctttattta tatgaaatgg cgagtcaaac 

20441 tcagctattg aacaacctat tgagttcatt 



atatgaatat gacagtgagt taatgtcagc tgatgaagat 
gacgcattaa aacaagctat acaaattatc gataaattaa 
acctaaaatg aaattcccga aaaagtacac tgaaataatc 
aaagctaaga ttgaagatga tttcattaaa gaaattaatg 
tggctaatat gaatgaacat gaattaaggg ctatgttaag 
tggcaatgat gattaaaaaa cttaaaaata tggattggtt 
attcggcgta atcgcactga tgcttgttgt catatcgcct 
gaagtatatc aagggacaat tacagataaa tataacaaga 
tgttagacaa caagcaagtc atcgaaaact ctgacttact 
acaagctagg ttaaaagtag gcgacaaagt agaagttaaa 
ttatatccgg tcttatacga agtaaagaag gtagataaat 
cttactagcg atgtatgagc taggtaagta tgtaactgag 
gatgtagagg cgccgagtga cttcgcaaag ttgagcgatc 
agtagatgta tagcaaagag tcaattgtta atatgatagg 
tgatgtaata ccggaatatg atagcaattc aattgcacag 
caaggggaaa actcaagtaa agttgaagat gttgttgtga 
agatgttaaa agaggttgag tttataaatc aatcgcaaca 
agagttattg aagaaaggtt ataacaggga tgcgattatc 
aacaacttct tagcgcgccg tgatgagtta gcagaaaaga 
aatgacagaa atgacgaaaa tgacactatt tttaaactgt 
attatcttaa gacgtggggt aatagccaca ttagatgttc 
ataaaagatg atatgttacg ctattaatca cctactacct 
ttttgagtca taactatttt cctcctttca catttattga 
tttttatatt taaataacta gagtaattaa cgtaaaggcg 
aacaccgaag caagaaaagt ttgtgctagg actcatagag 
gcagggtatt cgactaaagg taagagtggg gaatatctag 
ggaaggtttc cggaaggtac gaaaaattgc gtcaagaagt 
ggcctttgaa gaatatgagt ggctaaagaa tgtagctaag 
gcgacagctg atgcattcct cgctagttta gatggtatga 
ctaaaaagaa aatagaaact gaaattaaga tgcttgagaa 
tggaacagaa gataaaatca aacaacttca cgacgcaata 
atctttatat acggacaaac aaattgaaat attgaagcaa 
aatcacggag caaagcgtac aggtaaaaca atattaaaca 
tgcgaaagat agcagacgaa gaaggaattg agacacctca 
gattcaaaaa aacgtactaa tagagttaac taacaaatat 
tcattcatgt tatttggcgt tcaagtggtt cagacaggtc 
gtggtatgac atcgtttggt gcatatatca atgaagcgtc 
taagtcacgt tgtagtggaa ctggtgcaag aatattggta 
ttgttgaaag attatattga aaatacagat cctaaagcag 
acaataactt tcttaatgat agatataaag agtctattaa 
acgtaatatc aacggtatgt gggtgtctgg tgacggtgta 
acgattaaag cagatgaact ggacgacata cctatcaaag 
agcactatgg atctattgtg ttaataggac gaggtataga 
acaccaattt aagtttattg atgattgggt ggttattgca 
aatttttact gcgatactgc acgacctgaa tacatcactg 
acgctgataa aagtaaacta tcgggtgtgg aggaagttgc 
tctttatgat aatatggata ggtttaagca agaggtattt 
cctataaaag aatttgatga cgtgttggac tcgttaagat 
gattaaggag ggggaaatga cattgtataa gttaatagat 
catattgagg ctctaataga gtcacataaa gacgatagag 
agacacatat tgactatgta ccaatattca aacgtcgacc 
tggaaatgta aggcgattag acgtgtctgt taataacaaa 
gatacacgtg ttggttattt acatggtgtt cctgttactt 
aaaagttgaa aaagtttata accaactttg ccattagaaa 
taaaatggca gcaatttgcg gatatggtgc taggttagca 
aagaatatag atccctataa tgttattttt gttggcgaca 
acttttatga aaaagatgat gataatggca ctgattatgt 
ttatgtattt cgaggagaag gtattgacgc tttgcaagaa 
aatccattgt ttggtgtacc taacaacaaa gagatgatag 
acgcatatga tttaacaatg agcgatgcat caagtgagat 
acgcggtatg ggtatgagtg aagaaatgat tcaagaaaca 
aaagatatgg acgttaaata cttaacaaaa gatgtaaatg 
tcgaaaagaa tatcatgcgt tttgcaaagt cagtaaactt 
tatcattgga atgaaactta aacttatggc tttagagaac 
gctatgttga ggtatcaatt caaagttatt ttatctgcat 
atagttattt aaacctgata tttaagttca ctcgtaacat 
gctaattaac ctgaagggac aagtttcaga acgaacaagg 
gattacgaat tagacgaaat ggaaaaagaa agtcttgaat 
gtgacgcaaa tgacaaatcc caaaataacc aatcagaatg 
ctaaagcaga aaaaccaata gaacaactat ttgctaatcg 
tatgtttgag aaatatcaaa atgatgatgt gtatgttaca 
aataaggagt taactcgtat aggtacaatg ttgacttatg 
agtcacaaga agatgcttat atagaaaaat tccttatgag 
atctatgcag tttgatgttc cgagtaaaga ggtaatcaaa 
cgtttaatgc caacactaca aaaacatcgt gatgaagtat 
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20511 tgaaaaagat acgtatgcac attacacaag 

20581 aatacgtgat gatgtcggca tgtctaaagc 

20651 atgtcacaag ctggacttga tagcgcaatg 

2 0721 atgctactaa agatacacga acacgtgata 

20791 gaattttaaa tcaagtgggt gtgttgggca 

20861 aatattaatt gtcgttgcaa attactttat 

20931 gtaaagacga tggtaaaaat gaagttatcc 

21001 aggtggtaat tgatatggat tttaaaataa 

21071 acgcattaaa tccttgtacg aagagataat 

21141 aaaaatgaag ctgatttaga tatggttaaa 

21211 cactttttta gttgtctctt tgctactcga 

21281 cggactgtta gggtacgcga agggcaaaaa 

21351 tttgaagaac acaaagacga taaagaagta 

21421 acgttaaagg ctttttagat acagaagaag 

21491 gaaaggatta gaatcatgga aagagaaaaa 

21561 cctgagcaat cagaagaaca aaaacgtatt 

21631 caaaacgtga gaagttaaga agtaacgcgc 

21701 tgatagattt ttaggcgatt ctgatgaaga 

21771 aagtatgttc aaaaaggcgt tgagtctaaa 

21841 aagatttaga cccttcaaat gtaaagtcca 

21911 tgaggtaata aaatatggca actccaacat 

21981 cgttattcca gcagaacaag gtactttaat 

22051 gctaaaaatg agccaatgac agcacaaaag 

22121 gggtatcaga aacggaacgt attcaaactt 

22191 aattggtgta attattccgt tatcaaaaga 

22261 aaacctctaa ttgcagaggc attttacaaa 

22331 acaacacttc aactagtggt aaaccgcttg 

224 01 taataattta tacgtagacc tttcggcatt 

224 71 gtattaacta cacgttcatt cagaagtaaa 

22541 atgctaacgg gaacgagatt atgggattac 

22611 atcgttagca ctaatgggtg attgggatta 

22681 tctgaagatg ccacgttaac gacgttacaa 

22751 gtgatatgtt cgctttacgt gcgacgatgc 

22821 gcttaaacca actgaatagg aggagatatg 

22891 atatgactat tactgttaca aagaaggcat 

22961 atcacgtcgt actacgtctg ataagagcga 

23031 tcaagaaaaa tggagtggaa gttaaaagtg 

23101 agaagatgtt agggatataa caaacaatga 

23171 aaaaagtatg tcgcagatgt cctagagtat 

23241 gtatggggac agtgtcgtac acttataacg 

23311 taaacgagca aagtttcatc cgtttaaacc 

23381 aattccctca cactatttct attggaagta 

23451 tgtaagcgat aaaacaatta aaggatttat 

23521 atgtcacaag aatatgacag aaacctatat 

23591 agtatgaggg tagaatcttt agtattgaag 

23661 actacgactt aagcaggtgc catatggcaa 

23731 taagttcgat aagaaaatag aagagtgggt 

23801 actgctgtag cattagctcc tgttgactta 

23871 gtgggttatc cagtgttata agtgtcggcg 

23941 tgctactggt cctggtggta gtcgtgctac 

24011 tacaccacat atggtcaagc gccacagcca 

24081 agcagtattt ttcatagagg tggttaaata 

24151 taaaagatta atctcagacc ctaacattaa 

24221 gacgctgttt acccatatat tgttgtgggt 

24291 gagaaacagt cggtattgtc atacatgtgt 

24361 aagcgcgata ggttatgtgc ttaacagacc 

24431 gatagtcaag cagtattccc tgatatagac 

24501 acagacataa aaagaaaaac gaaggagtgt 

24571 agctgaaact gactt agate cagtagaatc 

24641 gaaaatgatt tagctgaaat agtacgaggc 

24711 ttaaattaac aattggtaat gtgcctggag 

24781 tggacagttg cgtatatggc tttatgagcg 

24851 tatgttgttc cagaatcatt tgaaatgtca 

24921 ttaaatggaa tacagcagaa ggtgctgaag 

24991 tacagttgaa tacgaaaaat teggegaaaa 

25061 tctgattcac acaeggaaga ccattctatg 

25131 tataaaaaaa ttgaaaagag gtatatattt 

25201 eggagaaaaa gattacgaag tagaagcaaa 

25271 gaagatagcg aagatgggag aaaaggagca 

25341 ctagaaacaa agegatttta caattttggg 

25411 acaattagaa aaagcaattg atgatttcat 

25481 ttggacaaac ttaacaatag tggttttttc 

25551 caccgaatat ggecaaaage gaggacaaag 

25621 caaggaaatc atgggcgcag aaccttacac 

25691 gatatatccc tgaacatgaa ttgttagcac 



gt at tat gag tggagagggt tactctaaga tagctaaagc 
tcaatcattg cgtgtggctc gtacagaagc aggcagagca 
gttgctaaag ataacggttt gaatatgaag aaacgttggc 
ctcatcgtca tttagatggg gaatcagtgg aaatagatca 
ggcgcccaag ctatttattg gtgtaaacag tgegaaagag 
tatattgatg aaaatgaatt gccaactgta atgagagcac 
cattcatgac ttatcgtgag tgggagaaat ataagegaaa 
aagtaaatgt tgatactggc gaagctatag aaaagttaga 
agagttacaa aacgaaaaag ttgttgtaaa cgtaacagtt 
acatctatta gcgaagaaaa tgctaaaaat aatgatttca 
ccttagcatg tegttaaact gctttttatt atgcactttt 
ggagttttga tatatgaata tcgaagaagt taagtctttt 
aaagattatc taaagggact taagacggtg tctgttgatg 
gtaaacgatt cattcaacct gaattagatc gttatcattc 
tcttgaggat ctaatcgaac aagaagtacg gaagcgtaat 
agtgctcttg aacaagagtt agaaaaaege gaegcagagg 
taggtaaagc gcaggaacta aatttaccaa catccttagt 
tactgagcaa aacttaaaag ctttaaaaga aacctttgac 
tttaaatcga gtggaagaga tgttaaagaa tcacgaaatc 
ttgaagaaat ggcgaaagaa atcaatatta gaaaataaag 
acacgccagg caatgttatt ttateggatt ttaaaaaegg 
catgaaagac attatggcta attcagcaat tatgaaatta 
aaaaaattta cttacttagc aaaaggtgta ggcgcctact 
etaagectga atatgegcaa gcagaaatgg aagctaagaa 
gtttcttaaa tggactgcaa aagatttctt taatgaggtt 
gcgtttgacc aagctgttat ctttggtact aaatcacctt 
ttgaaggege agaagagaaa ggtaacgttg ttacagatac 
aatggctact attgaagatg aagagttaga tecaaaegga 
atgcgtaatg etttagatge taatgacaga ccattatttg 
cactatctta tactggagcg gatgtatacg acaaaaagaa 
cgcacgttac ggtatcttac aaggtattga gtatgeaatt 
gcatcagatg cttctggcca accagtatca ttatttgaac 
atattgeata catgaaegtt aaaccagaag cgttcgcaac 
atggctaatc ctgeagaaga gattaaggta aaaaaagaca 
ttgactctta ttacagtctt gtcggttaca aagaggttaa 
gtgataaaaa tgactcttta tgaagatgtt aaacttttac 
atgaagaaga aatatttaag atggaagttg aeggaatact 
ttttatgaaa gatggtcaag tcatttatcc ttactcaatc 
tatcaacgac ctgaagttaa aaagaattta aagtcaagaa 
atggtgtccc tgattacatt agtggagtat taaacaggta 
aataaggtag aggtgttgtt tgtgtttaac ccatacgacg 
tcaaaaaagt aggagagtat ccaattatac aagagegett 
ggatacgect actacatctg aacaactaaa atttcatcaa 
gtaccttatg acttgecaat atctaaaaac aatttatttg 
gtgattctgt agatcagggc ggacaacatg aaattaagtt 
aagttaagta eggtgetgat agcatggttg ttgaattgga 
taaaaaaggt attgetaaaa caacgacgaa gatttacaac 
ggttttttag aagaaagtat tgactttaaa tatttcgatg 
cagattatgc aatatacgtt gaatacggta ctggtatata 
aaagattccg tggagtttta aaggtgatga eggegaatgg 
ttttggaacc ctgcaattga cgcaggacgc aagacattcg 
tgtgggtatc agttgagcct gaacttacaa atcaaatata 
caaactagtt gatgataggg tttttgacgt tgttcaagat 
gaatcaaacg tcactaacaa cgaatctagc gcaacaatga 
attcacagtt cgctacacaa tacgaggcta agctcatttt 
tatagaaata gataattacg agtttcaatt tagcegtate 
aggtttacta agcatggcac gataeggett ttatttaagt 
attaaatggc gcaaaaaaac tatttagcag ttgtacgtcc 
tttattatta gctgacttac aagaaggtgg acatacgatt 
ggtaaaaegg actattctcc caatgeaatg tcagaatcat 
ataaaggaat tgaagcagtg aaacacgctg tacaaacagg 
taataaacgt geagaeggta aacatcaegg aatgtttggt 
tttgatgatg aaagtgacaa aatcgaacta tcattaaaag 
ataacttgee gaaagagtgg tttgaagctg caggtgcgcc 
agteggaaca ttcgagaatc aaaagaaagc tagtgttgta 
taaactaata gatcaagggg gegtaagetc cctatttttt 
tgactgaatt taatccaatt acaacattaa aaattaatga 
agtaacattt gcatttgacc gaaaagctga aaaattctca 
atgecaggat tcaatgttat etttaaeggt ttgctagaat 
aatgtgctac tgcttattta aaaaacccac caactcgaga 
cactgaaaac gaggatactt tgccgttatt acaaggggct 
aagagggaga gtcgctcgta ctggatgaca ttgaacaaag 
aaatgacgaa agcaggcata gaaatgatga aagagaatta 
gattactcaa aaataaggca actgacagct agatatttag 
taacacctgc tgaatggcgt gattggctta ttggtggtca 
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25761 ggataggtac ctagatcaaa gacaattatt 

25831 aagaggctaa ctagtatgat tcgtgacatt 

25901 ctcgtgtaca aaaagctaga ttagaagaag 

25971 attccttgaa tcgaaaggag gttagccttt 

26041 gatttccaaa gcaacgtaag gaaagctcaa 

26111 atgtaaaagc agatatttca agattccaaa 

26181 agagcattct gttaaattat tcatgaaaac 

26251 gtagagcgat ttaaacaaca taaagtagat 

26321 caactaaagc tactgtcgaa gcttggagaa 

26391 aatggcggtt aaagggttta aagaagattt 

26461 agatggaaat taggaaataa attcacaaaa 

26531 gaagaattgg tcagattatg agaaaagaag 

26601 attgaaagat tacggcgaga aaatggacgc 

26671 caacaggtca aaggcttaat gattgctagt 

26741 caataatggc agtacttaat gcggttggtg 

26811 tgtcgcaggt cttggagttg ttggctttgg 

26881 acattggcag taacaaaaga agttcaaaac 

26951 atattgttaa agagaatcaa gcaagtatct 

27021 gatgtctcaa ttaaaaccat tcttatccga 

27091 aattgggtta aacattccga aacagctaag 

27161 tcggagattt attgaacgct gcaggacgat 

27231 gttgttcaaa tttgtgtctc aaggactaca 

27301 gctggtcaga atgctattaa agcgtttatt 

27371 ttggtaatgt gttcgctggt attggtaatt 

27441 ttggttggtt aaattaactt ctcaatttag 

27511 gactttatca gttatgttca agagaatggt 

27581 tagttgcttt tggtactgca atggctccta 

27651 atttatcgct aaactattcg aaacacaccc 

27721 ggtgtatttt gggctttaat ggctccgatt 

27791 gcttattcag cgtcactgaa aagattttag 

27861 agcattaata ggtgcattcg gttcgatttc 

27931 attggtgtcc tcgtttattt atggaaaaca 

28001 gtgttaaaac ggcagtttct ggtgcgattc 

28071 ccaatctacc ttacaaccga taatgcctat 

2 8141 gttttggtaa taggcatcat tacaaacgtt 

28211 cgttccaagc cataggaaca gtgatatccg 

28281 tcagttgctt actggcgact tctcaggtgc 

2 8351 acgatttggc aatacatgca atcagtttgg 

28421 cactttctat gtttggtaca agttggtcac 

28491 gaacactgtt acaagttggt tcagtcgagt 

28561 tttattatca caaaaggttc tgaatgggtt 

28631 tagctgatgg gtttaaaaga gttgtctcaa 

28701 aagtttcttc agtgatttct taaatgccgg 

28771 tctgcgcaca aagtagtcag cgcggtaggc 

28841 taagtggaca cggtggaggt agtagcttag 

28911 agactttggc agtgccttta ataaagagct 

28981 acttctatag acagacacat gactagcgat 

29051 atgtaacgat tagaaatgag ggcgaccttg 

29121 cggaagtttc aacttattat aagggaggtt 

29191 gttcacagta tcgcgtcagt gacaatcctt 

29261 aggcgcagga tatcatcgta actattctga 

29331 gaagaactta aaaaagtaga gcttaagata 

29401 agtcagacgt ccaagcacta tttgctggac 

29471 taagtatgag catatattag atataccaaa 

29541 caactttttg taggactagt aagtgaagtt 

29611 cgtttgaaac aaccgaacta ccatactttg 

29681 ccctgaaaaa tggtcggtac ctgatagatt 

29751 tacaacacta actcaggaga agtttattat 

29821 ttgttgaaat agagttagct gaagatgtta 

29891 aggaaatatc tcagttatta aggaagttga 

29961 acctatagag gttatttaaa tatagattct 

30031 ggaatcgatt caagtctaat aaagtaatga 

30101 agtagcctat gccaatttta ttaaaaagtc 

30171 aagtaaaaag ctaaatgaag atagttcttt 

30241 ataggtgcta taactaaaat gtggacgatc 

30311 ttgtcatact tgataagtct actattggcg 

30381 tgatgacctt aacaattcta ggatttacca 

30451 actgtcttta aaggaacggg ttataagtat 

30521 taggcaaagg agatacacga ttagaaatct 

30591 cgatgcaaag actaaaacgt ttcatttgta 

30661 ggtgtgaatg ctgataacgt caaaatacaa 

30731 gtgattttga tggacaacag acttttgcag 

30801 attgataggt aaaagagaag cgccaccgct 

30871 gcaatggagt tattgataaa gaaaagtgtc 

30941 atttcccaga agctaaccct aaaataggtg 



aattgaacaa gcgcaagcta acggcttagt acaagcttct 
gagaaacaac gttacgaaat aagagaacct ggtagctatg 
aaaaaagaag acgtgaactc ttcaaagaag gtacaagaaa 
ggatactcat tttatggcaa agattatggc caatattaga 
cgattagcaa agacgtctgt accaaacgaa attgaaacag 
gagctttaca acgcgctaaa tcaatggctc aacgatggcg 
agatgagtat aaagcgaatt tagaacgcgc taaagctcaa 
ttgaaactaa gtaacactga attaatggcc aaatataatg 
aacatgttgt taagttggat ttagatgcaa accccgctaa 
aatagatctt agcaggcata gttttgatat tgattccagc 
gaattcaatg aagtcgaagg agcagttaaa cgttctttcg 
taaatggaac aagtgatatt tggggtaaac ttaacaactc 
cttagctact aaaatccgaa ctttcggtac tatcttcgcg 
atacaagcat tgataccagt gattgccgga ttagtacctg 
tattaggtgg tggcgtttta ggtttagttg gcgcattctc 
tgcaatggct attagcgctc ttaaaatggt tgaagatgga 
tttagagatg cgagcgatca gttaaaaact acatggcgtg 
ttaatgcgat gtcagcaggt atcagaggcg ttacaagtgc 
agtatctatg ctagttgaag caaacgcacg cgagtttgag 
aaagcgtttg aagcattgaa tagcataggt ggcgcaatct 
ttggcgacgg attagttaac attttcactc aattaatgcc 
gaacatgtct atagctttcc aaaattgggc taatagtgta 
gactacacta ccactaactt acctaagatt ggtcagatat 
taatgattgc ttttgcacaa aacagttcca acatttttga 
agcatggtca gaacaagtag gacaatcaca agggtttaaa 
cctactatta tgcagttaat cggtaatatc gtaaaagcat 
tagctagtaa attgttagac tttatcacta atctagctgg 
agctatagca caagttgctg gcgttatggg tattttaggc 
gttgctataa gtagtgtact tacaaatgtg tttggtttga 
acttcgttag aacatcaagt ttagttactg gagctacgga 
agcacctatt ttagcagttg ttgcagtaat tggtgcattc 
aacgagaact ttagaaatac tattactgaa gcgtggaacg 
aaggtgtagt cggctggtta actgaattgt ggggcaaaat 
attgcaagta ttaggacaaa tattcatgca agttttaggt 
atgaatatca tacaaggttt gtggacttta attacaattg 
tagcagtcca aatcatagta ggtttgttca ctgctttaat 
ttgggagact attaaaacta cggttaccaa tgtgcttgat 
gagtcaatta tcggcttttt aactggcgta atgaatcgaa 
agatatggag tacaatcact aattttgtta gcagtatttg 
ggcttcgagt gtagctgaaa aaatggggca agcactaaac 
tctaacattt ggaatacagt tacaagtttc gcgagtaaag 
atgtaggtga cggtatgagt gatgcacttg gtaagattaa 
agcggaatta atcggcaaag tagctgaggg tgtagccaaa 
gatgcgattt catcagcttg ggactctgta acttcattcg 
gtaaaggttt agcggtatca caagcaaaag taattgctac 
atcctctact ttgacagata gtatagtaaa tcctgtaagt 
gttcaacata gcttaaaaga aaataataga cctattgtga 
atttaattaa atcacgcatt gatgacatga acgctataga 
tgttagttga tagcgcacga tatagaagta ataaggaatg 
tcacttataa tcacttggaa gtagttgaat ataacgttac 
tatagagggt attgatggta gatttcataa ttacgctaaa 
aggtataaag tacctaaaat tgcttatgct tcacatttaa 
gtttttattt aagggaatta gctacaccag acaattcaat 
agacaaacaa gcatttgagc ttgattatgt tgatggacga 
tcttttgaca caacacaaac atcaggggaa ttttctttgt 
aaagtgtcgg ttatagtact gatcttgaaa gtaataacga 
gcctacaaac gaaggtgata agaggcgtca aatgacattt 
aacggtgatg ttcctttaac acagtttaat cagtttaatg 
aagctaatga taaggatgga ttcactttct atacagataa 
tttaaaagcc ggagataaaa taatcttcga cggtaaacat 
tttaataaaa ctttagaaca accggtttta tatccaggct 
aacaaattac atttagacac aaattatatt ttagataagg 
tacagggtgt agggcacgct attaatgtta gtacaaaggt 
ggatctaact attatcgaga acgcgagtac gtttgacgca 
actcatgttg aaggtgaaga tgatttcaac gaatatgtaa 
aaaaaataag gcttgatatc aaagctaggc aaaaagaact 
agagtataac gaaagtttta caggcgttga gttcttcaat 
gtattacatc caaaagtaga tgcatctaaa ttcgagggat 
ttaaaaaagg acttgagcgt tatcatctcg aatatgaata 
tgatgaatta tctaagtttg ccaattatta cattaaagct 
gaagatgcat ctaaatgtta tacctttatt aaaggttatg 
aagcgggact acaaattgaa ttcactcatc cattagcaca 
tgttgatgga cgtattaaaa aagaagatag tttaaaaaaa 
actgcttcta tttccttaga ctttgtagcg ttacgtgaac 
atgttgttag agtggtggat tctgccatag gatataacga 
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31011 cttagtgaga atagtcgaaa tcactacaca tagagatgcg tacaataata tcactaagca agatgtagta 

31081 ttaggagact ttacaaggcg taatcgttat aacaaagcag ttcatgatgc tgcaaattat gttaaaagcg 

31151 taaaatctac aaaatccgac ccatctaaag aactaaaagc attaaacgca aaagttaacg caagtttatc 

31221 tataaataat gaattggtta agcagaatga aaaaataaac gctaaagtcg ataagatgaa tactaaaaca 

31291 gttacaactg ctaatggtac gatcatgtac gactttacta gtcaatcaag tataagaaac atcaaatcaa 

31361 ttggaacgat tggcgactct gtagctagag ggtcgcacgc aaaaactaat ttcacagaaa tgttaggcaa 

31431 gaaattgaaa gctaaaacga ctaatcttgc aagaggtggc gcaacaatgg caacagttcc aataggtaaa 

31501 gaagcggtag aaaacagcat ttatagacaa gcagagcaaa taagaggaga cctaatcata ttacaaggca 

31571 ctgatgatga ctggttacac ggttattggg caggcgtacc gataggcact gataaaacgg atacaaaaac 

31641 gttttacggt gccttttgtt ctgcaattga agttattaga aagaataatc cagattcaaa aatactagtg 

31711 atgacagcta caagacaatg ccctatgagt ggtacaacaa tacgccgtaa agacacggac aaaaacaaac 

31781 tagggttaac acttgaggac tatgtaaacg ctcaaatatt agcttgtagt gagttagatg taccagtgtt 

31851 tgacgcatat cacacagatt actttaagcc atacaatcca gcttttagga aagcgagcat ggaggacggc 

31921 ttacacccta acgaaaaagg tcacgaggtt attatgtacg agttaatcaa ggattattac agtttttacg 

31991 actaaaggag gcaaccaatg gcttacggat taattacaag tttacattca atgacaggtc ggaaaatagt 

32061 tgctcaacat gagtataact atcgcttgtt agatgaaggt atgagcaaac ttgagaaaat gtttatatac 

32131 catcaaaaag aagaaatata cgcacactca gcgaaacaaa ttaaatactt gaatgacagt gttgaagatt 

32201 atttaacgta tttaaatagc cgttttagca atatgattct aggccataac ggcgacggta tcaatgaagt 

32271 aaaagacgcg cgtattgata atacaggtta tggtcataag acattgcaag atcgtttgta tcatgattat 

32341 tcaacactag atgctttcac taaaaaggtt gagaaagctg tagatgaaca ctataaagaa tatcgagcga 

32411 cagaataccg attcgaacca aaagagcaag aaccggaatt tatcactgat ttatcgccat atacaaatgc 

32481 agtaatgcaa tcattttggg tagaccctag aacgaaaatt atttatatga cgcaagctcg tccaggtaat 

32551 cattacatgt tatctagatt gaagcccaac ggacaattta ttgatagatt gcttgttaaa aacggcggtc 

32621 acggtacaca caatgcgtat agatacattg atggagaatt atggatttat tcagctgtat tggacagtaa 

32691 caaaaacaac aagtttgtac gtttccaata tagaactgga gaaataactt atggtaatga aatgcaagat 

32761 gtcatgccga atatatttaa cgacagatat acgtcagcga tttataatcc tatagaaaat ttaatgattt 

32831 tcagacgtga atataaagct tctgaaagac aagctaagaa ttcattgaat ttcattgaag taagaagtgc 

32901 tgacgatatt gataaaggta tagacaaagt attgtatcaa atggatatac ctatggaata cacttcagat 

32971 acacaaccta tgcaaggtat cacttatgat gcaggtatct tatattggta tacaggtgat tcgaatacag 

33041 ccaaccctaa ctacttacaa ggtttcgata taaaaacaaa agaattgtta tttaaacgac gtatcgatat 

33111 tggcggtgtg aataataact ttaaaggaga cttccaagaa gctgagggtc tagatatgta ttacgatcta 

33181 gaaacaggac gtaaagcact tttaataggg gtaactattg gacctggtaa taacagacat cactcaattt 

33251 attctatcgg ccaaagaggt gttaaccaat tcttaaaaaa cattgcacct caagtatcga tgactgattc 

33321 aggtggacgt gttaaaccgt taccaataca gaacccagca tatctaagtg atattacgga agttggtcat 

33391 tactatatct atacgcaaga cacacaaaat gcattagatt tcccgttacc gaaagcgttt agagatgcag 

33461 ggtggttctt ggatgtactg cctggacact ataatggtgc tctaagacaa gtacttacca gaaacagcac 

33531 aggtagaaat atgcttaaat tcgaacgtgt cattgacatt ttcaataaga aaaacaacgg agcatggaat 

33601 ttctgtccgc aaaacgccgg ttattgggaa catatcccta agagtattac aaaattatca gatttaaaaa 

33671 tcgttggttt agatttctat atcactactg aagaatcaaa acgatttact gattttccta aagactttaa 

33741 aggtattgca ggttggatat tagaagtaaa atcgaataca ccaggtaaca caacacaagt attaagacgt 

33811 aataacttcc cgtctgcaca tcaattttta gttagaaact ttggtactgg tggcgttggt aaatggagtt 

33881 tattcgaagg aaaggtggtt gaataatgat agtagataat ttttcgaaag acgataactt aatcgagtta 

33951 caaacaacat cacaatataa tccaattatt gacacaaaca tcagtttcta tgaatcagat agaggaactg 

34 021 gtgttttaaa ttttgcagta actaagaata acagaccgtt atctataagt tctgaacatg ttaaaacatc 

34091 tatcgtgtta aaaaccgatg attataacgt agatagaggc gcttatattt cagacgaatt aacgatagta 

34161 gacgcaatta atgggcgttt gcagtatgtg ataccgaatg aatttttaaa acattcaggc aaggtgcatg 

34231 ctcaggcatt ctttacacaa aacgggagta ataatgttgt tgttgaacgt caatttagct tcaatattga 

34301 aaatgattta gttagtgggt ttgatggtat aacaaagctt gtttatatca aatctattca agatactatc 

34371 gaagcagtcg gtaaagactt taaccaatta aagcaagata tggatgatac acaaacgtta atagcaaaag 

34441 tgaatgatag tgcgacaaaa ggcattcaac aaatcgaaat caagcaaaac gaagctatac aagctattac 

34511 tgcgacgcaa actagtgcaa cacaagctgt tacagctgaa gtcgataaaa tagttgaaaa agagcaagcg 

34581 atttttgaac gtgttaacga agttgaacaa caaatcaatg gcgctgacct tgttaaaggt aattcaacaa 

34651 caaattggca aaagtctaaa cttacagatg attacggtaa agcaattgaa tcgtatgagc agtccataga 

34721 tagcgtttta agcgcagtta acacatctag gattattcat attactaatg caacagatgc gccagaaaag 

34791 acggatatag gcacgttaga gaagcctgga caagatggtg ttgatgacgg ttcttcgttc gatgaatcaa 

34861 cttatacatc aagcaaatct ggtgtgttag ttgtttatgt tgttgataat aatactgctc gtgcaacatg 

34931 gtacccagac gattcaaacg atgagtacac aaaatacaaa atctacggca catggtaccc gttttataaa 

35001 aagaatgatg gaaacttaac taagcaattt gttgaagaaa cgtctaacaa cgctttaaat caagctaagc 

35071 agtatgtaga tgataaattc ggaacaacga gctggcaaca acataagatg acagaggcga atggtcaatc 

35141 aattcaagtt aacttaaata atgcgcaagg cgatttggga tatttaactg ctggtaatta ctatgcaaca 

35211 agagtgccgg atttaccagg tagtgttgaa agttatgagg gttatttatc ggtattcgtt aaagacgata 

35281 caaacaagct atttaacttc acgccttata actctaaaaa gatttacaca cgatcaatca caaacggcag 

35351 acttgagcaa cagtggacag ttcctaatga acataagtca acggtattgt tcgacggtgg agcaaatggt 

35421 gtaggtacaa caatcaatct aaccgaacca tacacaaact attctatttt attagtaagt ggaacttatc 

35491 caggtggcgt tattgaggga ttcggactaa ccacattacc taatgcaatt caattaagta aagcgaatgt 

35561 agttgactca gacggtaacg gtggcggtat ttatgagtgt ttactatcca aaacaagtag cactacttta 

35631 agaatcgata acgatgtgta ctttgattta ggtaaaacat caggttctgg agcgaatgcc aacaaagtta 

35701 ctataactaa aattatgggg tggaaataat gaaaatcaca gtaaatgata aaaatgaagt tatcggatac 

35771 gttaatactg gcggtttacg caatagttta gatgtagacg ataacaatgt gtctatcaaa ttcaaagaag 

35841 agttcgaacc tagaaagttc gttttcacta acggcgaaat taaatacaat agcaatttcg aaaaagaaga 

35911 cgtaccgaat gcatcaaacc aacaaagtgc gtcagattta agtgatgagg aacttcgcgg aatggttgca 

35981 agtatgcaaa tgcagatgac gcaagtgaac atgttgacaa tgcaattgac gcaacaaaac gctatgttaa 

36051 cacaacagtt gaccgaactg aaaactaaca aaacaaatac tgagggggac gtttaaatga tgaagatgat 

36121 ttatccaact tttaaagaca ttaaaacttt ttatgtgtgg ggttgctata aaaatgagca aattaagtgg 

36191 tacgtagaca tgggtgtaat cgacaaagaa gaatatgcat tgatcactgg tgaaaaatat ccagaggcaa 
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36261 aagatgaaaa gtcacaggtg taatgcttga ggctttttaa tttaacacaa agtaggtggc gtaatgtttg 

36331 gatttaccaa acggcacgaa catgaatggc gaattagaag attagaagag aatgataaaa caatgcttag 

36401 cactctcaat gagattaaat taggtcaaaa aactcaagag caagttaaca ttaaattaga taaaacttta 

36471 gatgctatcc agagggaaag acagatagac gaaaaaaata agaaagaaaa cgacaaaaat atacgcgata 

36541 tgaaaatgtg gattctcggt ttgataggga ctatcttcag tacgattgtc atagctttac taagaactat 

36611 ttttggtatt taaaggaggt gattaccatg cttaaaggga ttttaggata tagcttctgg gcgtgcttct 

36681 ggtttggtaa atgtaaataa cagttaagag tcagtgcttc ggcactggct ttttattttg attgaaatga 

36751 ggtgcataca tgggattacc taacccaaag actagaaagc ctacagctag tgaagtggtg gagtgggcaa 

36821 agtcgaatat tggtaagagg attaatatag ataattatcg gggcagtcaa tgttgggata cacctaactt 

36891 tatttttaaa agatattggg gttttgtaac atggggcaat gctaaggata tggctaatta cagatatcct 

36961 aagggtttcc gattctatcg ttattcatct ggatttgtac cggaacctgg agacatcgca gtttggcacc 

37031 ctggcaacgg aataggttcg gacggacaca ccgcaatagt agtaggacca tctaataaaa gttattttta 

37101 tagcgttgac caaaactggg ttaattctaa tagttggaca ggttctccag gaagattagt aagacaccct 

37171 tatgtaagtg ttacaggctt tgttaggcct ccatactcaa aagatactag caaacctagt agtactgata 

37241 caagttcagc atcaaaagcc aatgactcaa caattactgg cgaagcgaag aaaccgcaat ttaaagaagt 

37311 taaaacagta aaatacactg cttacagcaa tgttttagat aaagaagagc acttcattga tcatatagtt 

37381 gtaatgggtg atgaacgctc agatattcaa ggattatata taaaagaatc aatgcatatg cgttctgtag 

37451 acgaactgta tacgcaaaga aataagttta taagcgatta tgaaataccg catttatatg tcgatagaga 

37521 ggctacatgg cttgctagac caaccaattt tgatgacccg cgtcacccta attggctagt tattgaagta 

37591 tgtggtggtc aaacagatag caaacgacaa ttcttattga atcaaataca agcgttaata cgtggtgttt 

37661 ggttattgtc agggattgat aaaaacttat ctgaaacgac gttaaaggta gaccctaata tttggcgtag 

37731 tatgaaagat ttaattaatt acgacttgat taagcaaggt ataccggata acgcaaagta tgagcaagtt 

37801 aaaaagaaaa tgcttgagac atacattaaa cgagatatat tgacacgaga aaatataaaa gaagtaacga 

37871 caaaaacaac aataagaatt agtgataaaa catcagttga cagtgcgtcc acacgaggcc ctactccatc 

37941 agacgaaaaa ccaagcatcg ttactgaaac aagtccattc acattccagc aagcactgga tagacaaatg 

38011 tctaggggta acccgaaaaa atctcataca tggggctggg ctaatgcaac acgagcacaa acgagctcgg 

38081 caatgaatgt taagcgaata tgggaaagta acacgcaatg ctatcaaatg cttaatttag gcaagtatca 

38151 aggcatttca gttagtgcgc ttaacaaaat acttaaagga aaaggaacgc tcgacggaca aggcaaagca 

38221 ttcgcggaag cttgtaagaa aaacaacatt aacgaaattt atttgatcgc gcacgctttc ttagaaagtg 

38291 gatacggaac aagtaacttc gctagtggta gatacggtgc atataattac ttcggtattg gtgcattcga 

38361 caacgaccct gattatgcaa tgacgtttgc taaaaataaa ggttggacat ctccagcaaa agcaatcatg 

38431 ggcggtgcta gcttcgtaag aaaggattac atcaataaag gtcaaaacac attgtaccga attagatgga 

38501 atcctaagaa tccagctacc caccaatacg ctactgctat agagtggtgc caacatcaag caagtacaat 

38571 cgctaagtta tataaacaaa tcggcttaaa aggtatctac ttcacaaggg ataaatataa ataaagaggt 

38641 gtgtaaatgt acaaaataaa agatgttgaa acgagaataa aaaatgatgg tgttgactta ggtgacattg 

38711 gctgtcgatt ttacactgaa gatgaaaata cagcatctat aagaataggt atcaatgaca aacaaggtcg 

38781 tatcgatcta aaagcacatg gcttaacacc tagattacat ttgtttatgg aagatggctc tatattcaaa 

38851 aatgagcccc ttattatcga cgatgttgta aaagggttcc ttacctacaa aatacctaaa aaggttatca 

38921 aacacgctgg ttatgttcgc tgtaagctgt ttttagagaa agaagaagaa aaaatacatg tcgcaaactt 

38991 ttctttcaat atcgttgata gtggtattga atctgctgta gcaaaagaaa tcgatgttaa attggtagat 

39061 gatgctatta cgagaatttt aaaagataac gcgacagatt tattgagcaa agactttaaa gagaaaatag 

39131 ataaagatgt catttcttac atcgaaaaga atgaaagtag atttaaaggt gcgaaaggtg ataaaggcga 

39201 accgggacaa cctggtgcga aaggtgatac aggtaaaaaa ggagaacaag gcgcacccgg taaaaacggt 

39271 actgtagtat caatcaatcc tgacactaaa atgtggcaaa ttgatggtaa agatacagat atcaaagcag 

39341 aacctgagtt attggacaaa atcaatatcg caaatgttga agggttagaa gataaattgc aagaagttaa 

39411 aaaaatcaaa gatacaactc tcaacgactc taaaacgtat acggattcaa aaattgctga actagttgat 

39481 agcgcgcctg aatctatgaa tacattaaga gaattagcag aagcaataca aaacaactct atttcagaaa 

39551 gtgtattgca acagattggc tcaaaagtta gtacagaaga ttttgaggaa ttcaaacaaa cactaaacga 

39621 tttatatgct ccaaaaaatc ataatcatga tgagcggtat gttttgtcat ctcaagcttt tactaaacaa 

39691 caagcggata atttatatca actaaaaagc gcatctcaac cgacggttaa aatttggaca ggaacagaaa 

39761 atgaatataa ctatatatat caaaaagacc ctaatacact ttacttaatt aaggggtgat ttttatggaa 

3 9831 ggtaatttta aaaatgtaaa gaagtttatt tacgaaggtg aagaatatac aaaagtatat gctggaaata 
39901 tccaagtatg gaaaaagcct tcatcttttg taataaaacc cttacctaaa aataaatatc cggatagcat 
39971 agaagaatca acagcaaaat ggacaataaa tggagttgaa cctaataaaa gttatcaggt gacaatagaa 
40041 aatgtacgta gcggtataat gagggtttcg caaactaatt taggttcaag tgatttagga atatcaggag 
40111 tcaatagcgg agttgcaagt aaaaatatca actttagtaa tccttcaggg atgttgtatg tcactataag 
40181 tgatgtttat tcaggatctc caacattgac cattgaataa ttttaaacga ctaatttttt agtcgttttt 
40251 tattttggat aaaaggagca aacaaatgga tgcaaaagta ataacaagat acatcgtatt gatcttagca 
40321 ttagtaaatc aattcttagc gaacaaaggt attagcccga ttccagtaga cgatgagact atatcatcaa 
40391 taatacttac tgttgttgct ttatatacta cgtataaaga caatccaaca tctcaagaag gtaaatgggc 

4 0461 aaatcaaaag ctaaagaaat ataaagctga aaacaagtat agaaaagcaa cagggcaagc gccaattaaa 
40531 gaagtaatga cacctacgaa tatgaacgac acaaatgatt tagggtaggt gttgaccaat gttgataaca 
40601 aaaaaccaag cagaaaaatg gtttgataat tcattaggga agcagttcaa tcctgatttg ttttatggat 
40671 ttcagtgtta cgattacgca aatatgtttt ttatgatagc aacaggcgaa aggttacaag gtttatacgc 
40741 ttataatatt ccatttgata ataaagcaag gattgaaaaa tacgggcaaa taattaaaaa ctatgatagc 
40811 tttttaccgc aaaagttgga tattgtcgtt ttcccgtcaa agtatggtgg cggagctgga catgttgaaa 
40881 ttgttgagag cgcaaattta aacactttca catcatatgg gcaaaattgg aatggtaaag gttggacaaa 
40951 tggcgttgcg caacctggtt ggggtcctga aactgttaca agacatgttc attattacga tgacccaatg 
41021 tattttatta gattaaattt cccagataaa gtaagtgttg gagataaagc taaaagcgtt attaagcaag 
41091 caactgccaa aaagcaagca gtaattaaac ctaaaaaaat tatgcttgta gccggtcatg gttataacga 
41161 tcctggagca gtaggaaacg gaacaaacga acgcgatttt atccgtaaat atataacgcc aaatatcgct 
41231 aagtatttaa gacatgcagg tcatgaagtt gcattatatg gtggctcaag tcaatcacaa gacatgtatc 
41301 aagatactgc atacggtgtt aatgtaggaa ataataaaga ttatggatta tattgggtta aatcacaggg 
41371 gtatgacatt gttctagaga ttcatttaga cgcagcagga gaaaatgcaa gtggtgggca tgttattatc 
41441 tcaagtcaat tcaatgcgga t act at t gat aaaagtatac aagatgttat taaaaataac ttaggacaaa 
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41511 taagaggtgt aacacctcgt aatgatttac 

41581 tttatctgaa ttaggtttta ttactaataa 

41651 tctaaattaa tagctggtgc gattcatggt 

41721 cagctaaaaa ccaaaaaaat ccaccagtgc 

41791 aaaagagact ggtaattaca cagttgccaa 

41861 tcaagaatta caggtgtatt acctaataac 

41931 atagatggat tacttatatt gctaatagtg 

42001 aggtaatagg ataagtagtt ttggtaagtt 

42071 cattaattat agggaatctt acagttatta 

42141 ttaacattac tctcaagatt taaatgtaga 

42211 taatgtaatt acattaccag taaccaatct 

42281 gaggacttac ttgcgtaaag tagtaagaag 

42351 gttgtttttt atgttatatt ataaatgatc 

42421 tatgcaaaaa aaacgaaaaa aagttcataa 

42491 ataccagttg agaggaggat aaaaagtgtt 

42561 atgtcagcaa ttgccatagc gaaaacattg 

42631 tctatatata aattctaaca ctaaaatact 

42701 taaacgtgtt tttaggcaac gatataagta 

42771 tttatggaag agggataaaa atgacagcaa 

42841 agaaacggga tataaaattg ctaaaaattc 

42911 aaaacatctt tatcagatgc cagatttaga 

42981 acgaagaaga taaataaaag gagccaaaaa 

43051 aaagaagtat ttgaatcagg taaaaacttt 

43121 atgatagata cgtagtactt gaccataaaa 

43191 caaaagaaaa ttagtaagtt aaataattag 

43261 cgcgtgtcaa atacgtgtca atttagttct 

43331 cgcatagtta taggcttttc agctatatac 

43401 tggaaacctt gatttaatgg ggttttaatc 

434 71 cacgttgacc ttgctctttt ttatgttcat 

43541 ataatggcct aatcttttgc taatatattc 



tgaacgttaa tgtatcagca gaaataaata tcaattatcg 
aaaagatatg gattggatta agaagaatta tgacttgtat 
aagcctatag gtggtttggt agctggtaat gttaaaacat 
cagcaggtta tacacttgat aagaataatg tgccttataa 
tgttaaaggt aataacgtaa gggacggcta ttcaactaat 
gcaacaatca aatatgacgg cgcatattgc atcaatgggt 
gacaacgtcg ctatattgcg acaggagagg tagataaagc 
tagcacgatt tagtatttac ttagaataaa aattttgcta 
aataactatt tggatggatg ttaatattcc tatacacttt 
taacaggcag gtactacggt acttgcctat ttttttgtta 
ggcttaaaac cacatttccg gtagccaatc cggctatgca 
ctgactgcat atttaaacca cccatactag ttgctgggtg 
aaaccacacc acctattaat ttaggagtgt ggttattttt 
aaagtattgc atatcacgtt taaccgtgtt ataataaggt 
agaaaatttt aaaactatag cagaaatcgc cttttataca 
aaaaaagacg ataagtaagt agacaagccc gaaagggctg 
atgaaaacaa tttacattat tttaatcatt cttatttgga 
aaagtgttgt tgcactgctt actactttac tgcttatcaa 
taaaagaaat aattgaatca atagaaaagt tattcgaaaa 
cggattacca tatcaaactg tgcaagattt aagaaatgga 
acgataataa agttatacga gtatcaaaga tcgcttgaaa 
tatgtttgtt acaaaagaag aatttaaaac tttgaatgta 
ataaaaatta cagatggaag acatgcaata tattgggtaa 
aaggcgattt gtacccgcaa aaagcatacc caaaatatat 
aaaaccacgt cttaattgac gtggttattt tttaggtttg 
atttctttag ttttctttct aaacttaatt gcttgtaaac 
caagataaga tttatcccgc cgtctccata aaaatatgct 
tagcaagtgt caaatatgtg tcaagaaaat aattttctga 
caagtaagtg agagtaggtg tctaaagtta tagatatatt 
aatagg 
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Table 10 



Bacteriophage 96 ORFs list 



SID 


LAN 


FRA 


POS 


a. a. 


RBS sequence 


STA 


STO 


100733 


96ORF001 


1 


25999. .29142 


1047 


ccttgaat cgaaaggaggtt agcc t 


ttg 


taa 


100734 


96ORF002 


1 


32008. .33906 


632 


tttttacgactaaaggaggcaacca 


atg 


taa 


100735 


96ORF003 


1 


30109. .31995 


628 


ttatattttagataaggagtagcct 


atg 


taa 


100736 


96ORF004 


1 


36760. .38634 


624 


attttgattgaaatgaggtgcatac 


atg 


taa 


100737 


96ORF005 


3 


33903 . .35729 


608 


gtttattcgaaggaaaggtggttga 


ata 


taa 


100738 


96ORF006 


2 


40589. .42043 


484 


aatgatttagggtaggtgttgacca 


atg 


tag 


100739 


96ORF007 


1 


18652 . .20091 


479 


tatacacacatactaaacctgaacg 


att 


tga 


100740 


96ORF008 


2 


8960. .10201 


413 


tggcagaatttgggggcgataacga 


atg 


tga 


100741 


96ORF009 


2 


17447. .18670 


407 


gacgcaataacggaagtgatcgtca 


atg 


tga 


100742 


96ORF010 


1 


38647. .39819 


390 


t aaat at aaat aaagaggt gt gt aa 


atg 


tga 


100743 


96ORF011 


-1 


119. .1195 


358 


gtagctcgcctacccttattatttt 


ttg 


tga 


100744 


96ORF012 


2 


20045. .21013 


322 


tttaatgacaaattacctgacatag 


atg 


tga 


100745 


96ORF013 


3 


29157. .30098 


313 


acttattataagggaggtttgttag 


ttg 


taa 


100746 


96ORF014 


1 


21925. .22839 


304 


agaaaataaagtgaggtaataaaat 


atg 


tag 


100747 


96ORF015 


1 


5812. .6591 


259 


atacacggtaaaggtgggagaatag 


atg 


taa 


100748 


96ORF016 


1 


7852. .8607 


251 


aat aaaatgttgaaaggagagaaaa 


atg 


taa 


100749 


96ORF017 


3 


3444 . .4190 


248 


aaatttaacattaatatcactttaa 


gtg 


taa 


100750 


96ORF018 


-3 


28281. .29000 


239 


taagctatgttgaacatcgctagtc 


atg 


tga 


100751 


96ORF019 


3 


7188. .7859 


223 


tttaccgttctaggacgtggtttaa 


atg 


taa 


100752 


96ORF020 


3 


21324. .21908 


194 


gaagggc aaaaaggagt 1 1 1 gat at 


atg 


taa 


100753 


96ORF021 


3 


6612. .7175 


187 


at t aaaaat t aa 1 1 aaaaggacggt 


ata 


tag 


100754 


96ORF022 


2 


24536. .25093 


185 


aaagaaaaacgaaggagtgtattaa 


atg 


taa 


100755 


96ORF023 


1 


5275. .5811 


178 


catgaaatggtaggaggtatgaaaa 


gtg 


tag 


100756 


96ORF024 


3 


14481. .15014 


177 


t aaaacgat aggagat aacgaat aa 


atg 


taa 


100757 


96ORF025 


2 


25157. .25666 


169 


at aaaaaaat t gaaaagaggt a t at 


att 


taa 


100758 


96ORF026 


-3 


15084. .15590 


168 


tcattcttaacatagcccttaattc 


atg 


tga 


100759 


96ORF027 


-1 


1229. .1732 


167 


aatagcaaataaaggagtgtaaaac 


atg 


taa 


100760 


96ORF028 


1 


16960. .17454 


164 


aaggcgtgtgat acagtgaaaacaa 


ttg 


taa 


100761 


96ORF029 


-1 


1736. .2227 


163 


tatgagaaaaggagtcatataaaag 


atg 


taa 


100762 


96ORF030 


1 


25531. .25995 


154 


ttttcaagagggagagtcgctcgta 


ctg 


tag 


100763 


96ORF031 


2 


23633. .24097 


154 


1 1 1 agt at t gaaggt gat t c t gt ag 


ate 


tag 


100764 


96ORF032 


-2 


2248. .2706 


152 


ataagacaccaaaggggtttggcgc 


atg 


tga 


100765 


96ORF033 


-3 


39147. .39605 


152 


agcatataaatcgtttagtgtttgt 


ttg 


taa 


100766 


96ORF034 


2 


13181. .13615 


144 


tagaagtcgaaaaagtggaggcaat 


ata 


taa 


100767 


96ORF035 


2 


10628. .11053 


141 


gagctaggattgcaagcaacgatat 


ttg 


tga 


100768 


96ORF036 


2 


24110, .24535 


141 


gtatttttcatagaggtggttaaat 


atg 


taa 


100769 


96ORF037 


1 


12583. .12996 


137 


atgaggaacagaagcaaccaacttt 


att 


tga 


100770 


96ORF038 


1 


15628. .16032 


134 


atgttaagaatgatgcctagtttaa 


ttg 


taa 


100771 


96ORF039 


3 


39816. .40220 


134 


ctaatacactttacttaattaaggg 


gtg 


taa 


100772 


96ORF040 


-3 


27528. .27932 


134 


tttccataaataaacgaggacacca 


atg 


tga 


100773 


96ORF041 


3 


16206. .16607 


133 


gat gagggcggaggt gt cagagt ag 


atg 


tga 


100774 


96ORF042 


2 


35720. .36106 


128 


aagttactataactaaaattatggg 


gtg 


taa 


100775 


96ORF043 


-2 


35713. .36081 


122 


ttaaacgtccccctcagtatttgtt 


ttg 


taa 


100776 


96ORF044 


-2 


9460. .9828 


122 


agtatccatcagttgaagataatct 


ata 


taa 


100777 


96ORF045 


-3 


5139. .5504 


121 


ttctttttgtattctgtaatattca 


att 


tga 


100778 


96ORF046 


2 


11513. .11872 


119 


aagt aaat gt at agaggt ggaat aa 


atg 


taa 


100779 


96ORF047 


2 


22991. .23350 


119 


gtcgtactacgtctgataagagcga 


gtg 


tag 


100780 


96ORF048 


3 


8607. .8963 


118 


tggaaaaagaattgagtgatgacta 


atg 


tga 


100781 


96ORF049 


1 


23353. .23697 


114 


atccgtttaaaccaataaggtagag 


gtg 


taa 


100782 


96ORF050 


-2 


2728. .3072 


114 


tggtaaattagtattacattaagta 


ata 


taa 


100783 


96ORF051 


3 


4692. .5021 


109 


tcaaaatatacggaggtagtcaact 


atg 


tga 


100784 


96ORF052 


-1 


20882. .21211 


109 


gtagcaaagagacaactaaaaaagt 


gtg 


taa 


100785 


96ORF053 


1 


40252. ,40578 


108 


acgactaattttttagtcgtttttt 


att 


tag 


100786 


96ORF054 


1 


4942. .5262 


106 


aatat aaaact aaaaaacaaaat 1 1 


atg 


tag 


100787 


96ORF055 


-2 


4840. .5151 


103 


ccgtcgcaatatatagttcgcttaa 


ate 


taa 


100788 


96ORF056 


3 


36324. .36623 


99 


aatttaacacaaagtaggtggcgta 


atg 


taa 


100789 


96ORF057 


2 


1394. .1690 


98 


cttcagtggctcttttagcatttaa 


ata 


taa 


100790 


96ORF058 


-3 


26247. .26537 


96 


tacttcttttctcataatctgacca 


att 


tga 


100791 


96ORF059 


-1 


21485 . .21772 


95 


agactcaacgcctttttgaacatac 


ttg 


tga 


100792 


96ORF060 


-3 


22647 . .22931 


94 


cctctttgtaaccgacaagactgta 


ata 


taa 


100793 


96ORF061 


1 


14023 . .14304 


93 


ttatctaattaagggggacgagtga 


gtg 


taa 
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100794 


96ORF062 


-2 


38281 . .38559 


92 


tatataacttagcgattgtacttgc 


ttg 


taa 


100795 


96ORF063 


-3 


30786 . .31064 


92 


gtctcctaatactacatcttgctta 


gtg 


tga 


100796 


96ORF064 


-2 


30205. .30480 


91 


atgcatctacttttggatgtaatac 


at a 


tag 


100797 


96ORF065 


1 


2617. .2886 


89 


aaggtctaataaaaatttctccttc 


ttg 


taa 


100798 


96ORF066 


3 


28056. .28325 


89 


aaggtgtagtcggctggttaactga 


att 


taa 


100799 


96ORF067 


-3 


17142. .17411 


89 


ttccgttattgcgtcgtgaagttgt 


ttg 


tga 


100800 


96ORF068 


2 


12326. .12589 


87 


aatgcatgtcgtttggtctgcctaa 


ttg 


tag 


100801 


96ORF069 


2 


42734. .42997 


87 


tttttaggcaacgatataagtaaaa 


gtg 


taa 


100802 


96ORF070 


1 


11869. .12129 


86 


aaatgttcaagaaatggagtgaagc 


at a 


taa 


100803 


96ORF071 


3 


15396. .15656 


86 


aacaagctatacaaattatcgataa 


att 


taa 


100804 


96ORF072 


-3 


37749. .38009 


86 


agattttttcgggttacccctagac 


att 


taa 


100805 


96ORF073 


3 


11244 . . 11501 


85 


acatgcatatatagaggtggaataa 


atg 


tag 


100806 


96ORF074 


-3 


42936 . .43193 


85 


aattatttaacttactaattttctt 


ttg 


taa 


100807 


96ORF075 


-3 


26610. .26867 


85 


tactgccaatgttccatcttcaacc 


att 


taa 


100808 


96ORF076 


-1 


11126. .11380 


84 


tttatctaatacatttaagttaacc 


ate 


taa 


100809 


96ORF077 


-2 


16537, .16791 


84 


tacccaccatataggcaggtagtag 


gtg 


tag 


100810 


96ORF078 


-3 


19521. .19775 


84 


aataactttgaattgatacctcaac 


at a 


tga 


100811 


96ORF079 


3 


13608. .13859 


83 


ttagggcaaatggaggcagacacaa 


atg 


tag 


100812 


96ORF080 


-3 


28029. .28280 


83 


tgagaagtcgccagtaagcaactga 


att 


tga 


100813 


96ORF081 


3 


20973. .21221 


82 


aatgaagttatcccattcatgactt 


ate 


tag 


100814 


96ORF082 


-1 


8729. .8974 


81 


cgattattgtgctttcaatttcaaa 


ttg 


tga 


100815 


96ORF083 


-3 


3147. .3392 


81 


tttagcctttatataatcaacttct 


gtg 


tga 


100816 


96ORF084 


3 


1611. .1853 


80 


tgctttatctttagtttctttcttt 


ttg 


tga 


100817 


96ORF085 


-2 


29470. .29709 


79 


ctcttatcaccttcgtttgtaggca 


ate 


taa 


100818 


96ORF086 


1 


35188 . .35424 


78 


gcgcaaggcgat t tgggat at 1 1 aa 


ctg 


tag 


100819 


96ORF087 


-2 


13039. .13275 


78 


ttttgattgagctctaaagtgtctt 


att 


tag 


100820 


96ORF088 


3 


24930. .25163 


77 


gaactatcattaaaagttaaatgga 


at a 


tga 


100821 


96ORF089 


-3 


22329. .22562 


77 


t ccagt ataagat agtggt aat ccc 


ata 


taa 


100822 


96ORF090 


-3 


16803. .17036 


77 


acctttagtcgaataccctgcgtca 


at a 


tag 


100823 


96ORF091 


-1 


22559. .22789 


76 


aacgcttctggtttaacgttcatgt 


atg 


taa 


100824 


96ORF092 


3 


18360. .18587 


75 


attgcaaaagatattgtaagtagat 


atg 


taa 


100825 


96ORF093 


-2 


25384. .25608 


74 


catgatttccttgtaattctctttc 


ate 


taa 


100826 


96ORF094 


1 


10417. .10638 


73 


aacacacat t aaggagtgt t aaaaa 


atg 


tag 


100827 


96ORF095 


3 


12963. .13184 


73 


tactaaacgaagataaaactatgac 


att 


taa 


100828 


96ORF096 


1 


42994. .43212 


72 


gatcgcttgaaaacgaagaagataa 


ata 


taa 


100829 


96ORF097 


-1 


36047. .36265 


72 


tcaagcattacacctgtgacttttc 


ate 


taa 


100830 


96ORF098 


-2 


36766. .36984 


72 


caggttccggtacaaatccagatga 


ata 


taa 


100831 


96ORF099 


-2 


34765. .34983 


72 


tcattctttttataaaacgggtacc 


atg 


tag 


100832 


96ORF100 


1 


10198. .10413 


71 


acaagaagactcagaggtttttcac 


atg 


taa 


100833 


96ORF101 


1 


15208. .15423 


71 


gagaaacaagttaagataaggagag 


atg 


tga 


100834 


96ORF102 


3 


4209. .4424 


71 


attttaaaacgaaatataggagagg 


ctg 


tag 


100835 


96ORF103 


3 


11673. .11888 


71 


catgcaccttatggtatgcgcttag 


ctg 


taa 


100836 


96ORF104 


3 


12117. .12332 


71 


tttacgtccaaagagcttttgactt 


gtg 


taa 


100837 


96ORF105 


3 


23892. .24107 


71 


gatggtgggttatccagtgttataa 


gtg 


taa 


100838 


96ORF106 


-3 


34428 . .34643 


71 


tagacttttgccaatttgttgttga 


att 


taa 


100839 


96ORF107 


-3 


24495. .24710 


71 


ggcacattaccaattgttaatttaa 


atg 


taa 


100840 


96ORF108 


-1 


23876. .24088 


70 


acatatttaaccacctctatgaaaa 


ata 


taa 


100841 


96ORF109 


-2 


17317. .17529 


70 


acctgtacgctttgctccgtgatta 


att 


taa 


100842 


96ORF110 


-3 


38931. .39143 


70 


actttcattcttttcgatgtaagaa 


atg 


taa 


100843 


960RF111 


-3 


21855. .22067 


70 


agtaaattttttcttttgtgctgtc 


att 


tga 


100844 


960RF112 


1 


3217. .3426 


69 


aaatgtcaacgggaggtgatacgaa 


atg 


taa 


100845 


960RF113 


-1 


25469. .25678 


69 


tcagggatatatcctaaatatctag 


ctg 


taa 


100846 


960RF114 


-2 


9838. .10047 


69 


ataataatcatcacggtaaagtagc 


ate 


tga 


100847 


960RF115 


1 


13819. .14022 


67 


gcagtaggggttatggcaggtcaag 


ttg 


tga 


100848 


960RF116 


-1 


41033. .41236 


67 


caacttcatgacctgcatgtcttaa 


ata 


taa 


100849 


960RF117 


-3 


24711. .24914 


67 


tctgctgtattccatttaactttta 


atg 


taa 


100850 


960RF118 


-1 


12374. .12574 


66 


tccatctcctctaaaataaagttgg 


ttg 


taa 


100851 


960RF119 


-1 


3980. .4180 


66 


ctcctatatttcgttttaaaatttc 


att 


tga 


100852 


96ORF120 


-3 


6033. .6233 


66 


ttgtaatttagaaatataacgataa 


ata 


taa 


100853 


960RF121 


-2 


37939. .38136 


65 


ctgaaatgccttgatacttgcctaa 


att 


tga 


100854 


960RF122 


2 


37892 . .38086 


64 


acgacaaaaacaacaataagaatta 


gtg 


tga 


100855 


960RF123 


-3 


29193 . .29387 


64 


ggacgtctgactttaaatgtgaagc 


ata 


tga 


100856 


960RF124 


1 


4408. .4599 


63 


tttatcggtaccaatttaatgatta 


atg 


taa 


100857 


960RF125 


-1 


7787. .7978 


63 


ttaaaaatccaagttttgccatcgt 


att 


tga 


100858 


960RF126 


-3 


27027 . .27218 


63 


aaatttgaacaacggcattaattga 


gtg 


tga 


100859 


960RF127 


3 


15051. .15239 


62 


atcgagtcaaggaggttttggggaa 


gtg 


tga 


100860 


960RF128 


-1 


6914, .7102 


62 


agcgaatgggtttgattgttgactc 


ata 


tga 


100861 


960RF129 


-3 


31332. .31520 


62 


tcttatttgctctgcttgtctataa 


atg 


tga 


100862 


96ORF130 


-3 


30084 . .30272 


62 


gaaatcatcttcaccttcaacatga 


gtg 


taa 


100863 


960RF131 


3 


11058. .11243 


61 


agaaaaagagaaatgaagtgatcta 


atg 


taa 


100864 


960RF132 


-1 


36434 . .36619 


61 


taagcatggtaatcacctcctttaa 


ata 


tga 



SD-63311.4 



157 



248/037 



100865 


960RF133 


-1 


35591. .35776 


61 


ctaaactattgcgtaaaccgccagt 


att 


taa 


100866 


960RF134 


-2 


9250. .9435 


61 


atccatgagcttataacccgtctta 


att 


tga 


100867 


960RF135 


1 


29563. .29745 


60 


cgacaac 1 1 1 1 tgt aggac t agt aa 


gtg 


tga 


100868 


960RF136 


-3 


12486. .12668 


60 


cactttactttcaacttgttcagga 


ttg 


taa 


100869 


960RF13 7 


-1 


14501. .14680 


59 


caaactgaaagctaagtaatcagca 


ate 


tga 


100870 


960RF138 


-2 


23326. .23505 


59 


cttgtgacatttgatgaaattttag 


ttg 


tga 


100871 


960RF139 


-3 


42672. .42851 


59 


aatccggaatttttagcaattttat 


ate 


taa 


100872 


96ORF140 


-3 


31137. .31316 


59 


acttgattgactagtaaagtcgtac 


atg 


taa 


100873 


960RF141 


-3 


18969. .19148 


59 


aacaaaaataacattatagggatct 


ata 


taa 


100874 


960RF142 


-3 


4740. .4919 


59 


cataaattttgttttttagttttat 


att 


tga 


100875 


960RF143 


2 


36107. .36283 


58 


aacaaatactgagggggacgtttaa 


atg 


taa 


100876 


960RF144 


3 


16029. .16205 


58 


tatacgaagtaaagaaggtagataa 


ata 


tag 


100877 


960RF145 


-3 


29013. .29189 


58 


tgtcactgacgcgatactgtgaacc 


att 


tga 


100878 


960RF146 


-3 


14883. .15059 


58 


aatctttgaatgttgtgactaagta 


ttg 


taa 


100879 


960RF147 


-1 


18251. .18424 


57 


tatcagcgttaattgcacgtaatct 


atg 


taa 


100880 


960RF148 


-1 


13583. .13756 


57 


aataccttctttaactgaatgttga 


ata 


taa 


100881 


960RF149 


-2 


10756 . .10929 


57 


taaattcacatctctatactgatat 


ctg 


tag 


100882 


96ORF150 


2 


14171. .14341 


56 


atttttaatgaagaagtgttattaa 


ctg 


tag 


100883 


960RF151 


2 


19217. .19387 


56 


cctacatactcattgcgctactttt 


atg 


tga 


100884 


960RF152 


-1 


12614 . .12784 


56 


atttctacagtaaaaatatctttat 


ctg 


taa 


100885 


960RF153 


-2 


11836. .12006 


56 


ttgcattacctattgcgaatgctag 


ttg 


taa 


100886 


960RF154 


-2 


4165. .4335 


56 


atataacgcttttgtcctcgaccaa 


ate 


tga 


100887 


960RF155 


-3 


40464. .40634 


56 


aaatcaggattgaactgcttcccta 


atg 


tga 


100888 


960RF156 


3 


423. .590 


55 


tggtaattttgataatttagcttta 


ata 


taa 


100889 


960RF157 


-1 


41879. .42046 


55 


gtagcaaaatttttattctaagtaa 


ata 


taa 


100890 


960RF158 


-2 


36166. .36333 


55 


cattcatgttcgtgccgtttggtaa 


ate 


tag 


100891 


960RF159 


-2 


16228. .16395 


55 


tttaacatctgagcataccttttat 


ttg 


taa 


100892 


96ORF160 


3 


1038. .1202 


54 


atctctaagcagttgttgagcagcg 


ttg 


taa 


100893 


960RF161 


-1 


19193. .19357 


54 


tctttgttgttaggtacaccaaaca 


atg 


tag 


100894 


960RF162 


-1 


18074. .18238 


54 


ctcgtcctattaacacaatagatcc 


ata 


tga 


100895 


960RF163 


-1 


15386. .15550 


54 


agccatcataggactgtaaaattca 


ctg 


taa 


100896 


960RF164 


-1 


10049. .10213 


54 


tacatcgatttcaataagcttttga 


att 


tag 


100897 


960RF165 


-2 


18514. .18678 


54 


gtgcttcaatatcatctattaactt 


ata 


taa 


100898 


960RF166 


-2 


11104. .11268 


54 


ctagccatgattacccttaaattag 


ttg 


tag 


100899 


960RF167 


-3 


13764. .13928 


54 


agacagtttataatgtgtatctcta 


ata 


tga 


100900 


960RF168 


1 


14305. .14466 


53 


ttttgaatttttggaggacgagtaa 


atg 


tag 


100901 


960RF169 


-1 


17885. .18046 


53 


gtgttgaagccttaatagactcttt 


ata 


tga 


100902 


96ORF170 


-1 


10790. .10951 


53 


taggcgctttacatatccacgttaa 


att 


taa 


100903 


960RF171 


-3 


12765. .12926 


53 


atcttcgtttagtatataaaacgct 


ctg 


taa 


100904 


960RF172 


3 


22836. .22994 


52 


cgttcgcaacgcttaaaccaactga 


ata 


tga 


100905 


960RF173 


-1 


15956. .16114 


52 


ctctacatcatcattagccgtcgtc 


ata 


taa 


100906 


960RF174 


-1 


10571. .10729 


52 


tagtgccattcatattactttctaa 


ata 


taa 


100907 


960RF175 


-1 


3440. .3598 


52 


cagcctatcttcactatcaacatga 


ttg 


taa 


100908 


960RF176 


-3 


37170. .37328 


52 


tttatctaaaacattgctgtaagca 


gtg 


taa 


100909 


960RF177 


-3 


6693. .6851 


52 


ttcctaatctactaagtaactcgat 


ata 


taa 


100910 


960RF178 


-3 


5655. .5813 


52 


gacatcttgattagttttttcagtc 


ate 


tag 


100911 


960RF179 


1 


34564 . .34719 


51 


gttacagctgaagtcgataaaatag 


ttg 


tag 


100912 


96ORF180 


1 


42661. .42816 


51 


atataaattctaacactaaaatact 


atg 


tga 


100913 


960RF181 


-2 


37741. .37896 


51 


tggacgcactgtcaactgatgtttt 


ate 


taa 


100914 


960RF182 


-2 


25039. .25194 


51 


ttcgtaatctttttctccgtcatta 


att 


tga 


100915 


960RF183 


-2 


4534 . .4689 


51 


tcagttttaatattttcagccatag 


ttg 


tga 


100916 


960RF184 


1 


6721. .6873 


50 


ggagctggagaatttacagtaaaag 


ttg 


tag 


100917 


960RF185 


2 


36548. .36700 


50 


acaaaaatatacgcgatatgaaaat 


gtg 


taa 


100918 


960RF186 


-1 


40025. .40177 


50 


tggagatcctgaataaacatcactt 


ata 


tga 


100919 


960RF187 


-1 


34466. .34618 


50 


attacctttaacaaggtcagcgcca 


ttg 


tga 


100920 


960RF188 


-1 


33842. .33994 


50 


agttcctctatctgattcatagaaa 


ctg 


taa 


100921 


960RF189 


-1 


24914 . .25066 


50 


acatagaatggtcttccgtgtgtga 


ate 


taa 


100922 


96ORF190 


-2 


20395. .20547 


50 


tatcttagagtaaccctctccactc 


ata 


tga 


100923 


960RF191 


3 


24768. .24917 


49 


aaaggaattgaagcagtgaaacacg 


ctg 


taa 


100924 


960RF192 


-1 


16169. .16318 


49 


ttgtggtttcggcaacgttgcttgt 


atg 


tga 


100925 


960RF193 


-2 


39100. .39249 


49 


cagtaccgtttttaccgggtgcgcc 


ttg 


taa 


100926 


960RF194 


-2 


25921. .26070 


49 


ttggtacagacgtctttgctaatcg 


ttg 


taa 


100927 


960RF195 


-2 


17779. .17928 


49 


caaccaatgctcgggatggtcaggg 


ttg 


tga 


100928 


960RF196 


-2 


14182. .14331 


49 


ttaaatacttttcttctagcaatgc 


ate 


tga 


100929 


960RF197 


-2 


7609, .7758 


49 


ttatcatcaaacgacttaacaccaa 


ttg 


tga 


100930 


960RF198 


-2 


1537. .1686 


49 


ttattagctagtgcgttagtgttag 


gtg 


taa 


100931 


960RF199 


-3 


7719. .7868 


49 


taatacttgtatcggatagtcatct 


att 


taa 


100932 


96ORF200 


2 


22271. .22417 


48 


ttctttaatgaggttaaacctctaa 


ttg 


tag 


100933 


96ORF201 


2 


30353. .30499 


48 


tctactattggcgaaaaaataaggc 


ttg 


tag 


100934 


96ORF202 


2 


32591. .32737 


48 


agattgaagcccaacggacaattta 


ttg 


taa 


100935 


96ORF203 


2 


39131. .39277 


48 


agcaaagactttaaagagaaaatag 


ata 


tag 
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100936 


96ORF204 


-2 


36985 . .37131 


48 


atcttcctggagaacctgtccaact 


att 


tga 


100937 


96ORF205 


-3 


38721 . .38867 


48 


aaggaacccttttacaacatcgtcg 


at a 


taa 


100938 


96ORF206 


-3 


35880 . .36026 


48 


gttaacatagcgttttgttgcgtca 


att 


taa 


100939 


96ORF207 


-3 


11550 . . 11696 


48 


ttgctctctcgctccatgattttgg 


at a 


taa 


100940 


96ORF208 


2 


37178 . .37321 


47 


agattagtaagacacccttatgtaa 


gtg 


taa 


100941 


96ORF209 


2 


42341 . .42484 


47 


tgcatatttaaaccacccatactag 


ttg 


taa 


100942 


96ORF210 


3 


41850 . .41993 


47 


aaaggtaataacgtaagggacggct 


att 


tag 


100943 


960RF211 


-1 


6662 . .6805 


47 


ttgttggaatggtgggacgaattgg 


ttg 


tga 


100944 


960RF212 


-2 


25213. .25356 


47 


agtagcacattcccaaaattgtaaa 


ate 


taa 


100945 


960RF213 


-3 


42219. .42362 


47 


gtggtttgatcatttataatataac 


ata 


taa 


100946 


960RF214 


3 


27834 . .27974 


46 


aaaagat tt t agactt cgt t agaac 


ate 


tag 


100947 


960RF215 


3 


35811. .35951 


46 


ttacgcaatagtttagatgtagacg 


ata 


taa 


100948 


960RF216 


-1 


5402 . .5542 


46 


tttccgtaaggtgtattcaacttga 


att 


tga 


100949 


960RF217 


-2 


24229 . .24369 


46 


tataggtctgttaagcacataacct 


ate 


taa 


100950 


960RF218 


-2 


6253 . .6393 


46 


ttgtcattcttgctaacacgtcaga 


ttg 


taa 


100951 


960RF219 


1 


883 . . 1020 


45 


aaatcactcccgaaatattcgttaa 


ata 


taa 


100952 


96ORF220 


2 


32936 . .33073 


45 


gataaaggtatagacaaagtattgt 


ate 


taa 


100953 


960RF221 


3 


41703 . .41840 


45 


ggtaagcctataggtggtttggtag 


ctg 


taa 


100954 


960RF222 


-1 


39860 . .39997 


45 


acttttattaggttcaactccattt 


att 


taa 


100955 


960RF223 


-1 


24716 . .24853 


45 


acatttcaaatgattctggaacaac 


ata 


taa 


100956 


960RF224 


-2 


26794. .26931 


45 


caatatcacgccatgtagtttttaa 


ctg 


taa 


100957 


960RF225 


-2 


19201. .19338 


45 


caaacaatggattgtaatcaaataa 


atg 


tga 


100958 


960RF226 


-2 


15709. .15846 


45 


tgacttgcttgttgtctaacacaat 


ata 


taa 


100959 


960RF227 


-3 


36711 . .36848 


45 


acattgactgccccgataattatct 


ata 


tga 


100960 


960RF228 


3 


2325 . .2459 


44 


tcgccatagtgagttccaataccgt 


ata 


taa 


100961 


960RF229 


-1 


38612. .38746 


44 


ttgtcattgatacctattcttatag 


atg 


tga 


100962 


96ORF230 


-1 


31733 . .31867 


44 


gctggattgtatggcttaaagtaat 


ctg 


tag 


100963 


960RF231 


-2 


12076. .12210 


44 


tgactcatagctttaacttgttcgt 


ctg 


taa 


100964 


960RF232 


-3 


31644. .31778 


44 


atagtcctcaagtgttaaccctagt 


ttg 


taa 


100965 


960RF233 


-3 


23988. .24122 


44 


atttgatttgtaagttcaggctcaa 


ctg 


taa 


100966 


960RF234 


-3 


17529. .17663 


44 


agtacgtttttttgaatcgtaccta 


atg 


taa 


100967 


960RF235 


1 


7153, .7284 


43 


aatgct aatggt c caat agaaat ca 


atg 


tag 


100968 


960RF236 


2 


2681. .2812 


43 


ttctttcacttcaacttcacatttc 


ata 


tga 


100969 


960RF237 


2 


4496, .4627 


43 


gtactatgcttcacagtcttagcga 


ttg 


taa 


100970 


960RF238 


-1 


41720. .41851 


43 


cacctgtaattcttgaattagttga 


ata 


tga 


100971 


960RF239 


-1 


35324. .35455 


43 


acttactaataaaatagaatagttt 


gtg 


taa 


100972 


96ORF240 


-1 


8570. .8701 


43 


atccccgttttgacttaatacatca 


ate 


tga 


100973 


960RF241 


-2 


33502. .33633 


43 


ataattttgtaatactcttagggat 


atg 


tag 


100974 


960RF242 


-2 


23662. .23793 


43 


agct aatgct acagcagtgttgtaa 


ate 


tag 


100975 


960RF243 


-3 


32391. .32522 


43 


acctggacgagcttgcgtcatataa 


ata 


tag 


100976 


960RF244 


-3 


30273 . .30404 


43 


aaaactttcgttatactcttggtaa 


ate 


tga 


100977 


960RF245 


-3 


5895 . .6026 


43 


tgcactaaaatgcttataattctta 


ate 


taa 


100978 


960RF246 


-3 


2679. .2810 


43 


attcatcaagaaactatagccggtc 


atg 


tga 


100979 


960RF247 


1 


34891. .35019 


42 


acatcaagcaaatctggtgtgttag 


ttg 


taa 


100980 


960RF248 


2 


30668. .30796 


42 


aattattacattaaagctggtgtga 


atg 


tag 


100981 


960RF249 


2 


31838 . .31966 


42 


caaatattagcttgtagtgagttag 


atg 


taa 


100982 


96ORF250 


2 


33539. .33667 


42 


c 1 1 accagaaacagc acaggt agaa 


ata 


taa 


100983 


960RF251 


-1 


20486. .20614 


42 


cttctgtacgagccacacgcaatga 


ttg 


tag 


100984 


960RF252 


-1 


15128. .15256 


42 


gatatttcattactagctactacta 


ata 


tga 


100985 


960RF253 


-2 


41446. .41574 


42 


aaaacctaattcagataaacgataa 


ttg 


tga 


100986 


960RF254 


-2 


41005. .41133 


42 


gt t at aac catgaccggc t acaagc 


ata 


taa 


100987 


960RF255 


-2 


23008. .23136 


42 


aggataaatgacttgaccatctttc 


ata 


taa 


100988 


960RF256 


-2 


14794 . .14922 


42 


ttgtatgcgtcaatgagttggtcga 


ttg 


tag 


100989 


960RF257 


-2 


8503. .8631 


42 


tacctaacttttttaataatttcta 


atg 


tga 


100990 


960RF258 


-3 


22143 . .22271 


42 


aaacgctttgtaaaatgcctctgca 


att 


tga 


100991 


960RF259 


-3 


18639. .18767 


42 


cttgtatctattatagagattaacc 


att 


tag 


100992 


96ORF260 


-3 


15624 . .15752 


42 


gttttggtaactagccactgtatag 


ata 


taa 


100993 


960RF261 


2 


18746 . .18871 


41 


catattgaggctctaatagagtcac 


ata 


taa 


100994 


960RF262 


-1 


13067 . .13192 


41 


aattaattaattcttctcttgttgg 


ttg 


taa 


100995 


960RF263 


-2 


18742 . .18867 


41 


taacagacacgtctaatcgccttac 


att 


tga 


100996 


960RF264 


-2 


18376. .18501 


41 


catattatcataaagaacaagtaac 


ttg 


taa 


100997 


960RF265 


-2 


367. .492 


41 


ctaaacgaaaaagagggtacaatac 


ate 


tga 


100998 


960RF266 


-3 


32802. .32927 


41 


aggtatatccatttgatacaatact 


ttg 


taa 


100999 


960RF267 


-3 


10194. .10319 


41 


at cat cgaaaggcgataact cgt t a 


ttg 


tga 


101000 


960RF268 


1 


1159. .1281 


40 


ttattcttcctttttgtaattgtaa 


atg 


taa 


101001 


960RF269 


2 


10373. .10495 


40 


gacagagttgaaaagaaaatcatga 


atg 


taa 


101002 


96ORF270 


2 


15734. .15856 


40 


ttattcggcgtaatcgcactgatgc 


ttg 


tag 


101003 


960RF271 


-1 


43451. .43573 


40 


c c tNo shine -dalgarno 
sequence 


att 


tga 


101004 


960RF272 


-1 


36959. .37081 


40 


acgctataaaaataacttttattag 


atg 


tag 


101005 


960RF273 


-1 


35798. .35920 


40 


ctgacgcactttgttggtttgatgc 


att 


taa 
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101006 


960RF274 


-1 


8147. .8269 


40 


tctgtctctctatgtttgttagtct 


ctg 


tga 


101007 


960RF275 


-2 


43066. .43188 


40 


tttaacttactaattttcttttgat 


at a 


tga 


101008 


960RF276 


-2 


42535. .42657 


40 


aaat aatgt aaat tgt 1 1 1 c at agt 


att 


tag 


101009 


960RF277 


-2 


30628. .30750 


40 


tttgtagtcccgcttctgcaaaagt 


ctg 


taa 


101010 


960RF278 


-2 


13291. .13413 


40 


ttcgtatcttccaagcaattcattt 


ttg 


tga 


101011 


960RF279 


-2 


3172. .3294 


40 


cagattgtttagtaacgcctaattt 


ate 


taa 


101012 


96ORF280 


-3 


18804 . .18926 


40 


taaataaccaacacgtgtatcaaca 


att 


tag 


101013 


960RF281 


-3 


15843 . .15965 


40 


atttaaaaagtgtattctataacca 


ate 


tag 


101014 


960RF282 


-3 


8460. .8582 


40 


ttagtcatcactcaattctttttcc 


att 


taa 


101015 


960RF283 


-3 


7593. .7715 


40 


gatgttgtctacacagtgctaacac 


atg 


taa 


101016 


960RF284 


-3 


6453. .6575 


40 


aattaatttttaattaccatttcta 


att 


tga 


101017 


960RF285 


1 


15082 . .15201 


39 


caatacttagtcacaacattcaaag 


att 


taa 


101018 


960RF286 


1 


34444 . .34563 


39 


acacaaacgttaatagcaaaagtga 


atg 


tag 


101019 


960RF287 


2 


27920 . .28039 


39 


cctattttagcagttgttgcagtaa 


ttg 


tag 


101020 


960RF288 


2 


28415 . .28534 


39 


atcggctttttaactggcgtaatga 


ate 


tag 


101021 


960RF289 


2 


38147. .38266 


39 


tatcaaatgcttaatttaggcaagt 


ate 


tga 


101022 


96ORF290 


3 


40917. .41036 


39 


gcaaatttaaacactttcacatcat 


atg 


taa 


101023 


960RF291 


-2 


38815. .38934 


39 


tctctaaaaacagcttacagcgaac 


at a 


taa 


101024 


960RF292 


-2 


32671. .32790 


39 


ctataggat tat aaat cgctgacgt 


at a 


tga 


101025 


960RF293 


-2 


31216. .31335 


39 


ttgatttgatgtttcttatacttga 


ttg 


taa 


101026 


960RF294 


-2 


21589. .21708 


39 


gtatcttcatcagaatcgcctaaaa 


ate 


taa 


101027 


960RF295 


-2 


18976. .19095 


39 


tatcaatatatgctaacctagcacc 


at a 


taa 


101028 


960RF296 


-2 


11482. .11601 


39 


gccacctcgtactctttttgcaacc 


att 


taa 


101029 


960RF297 


-3 


12933. .13052 


39 


tcacgaaataatgtttctttaattt 


at a 


taa 


101030 


960RF298 


-3 


8262. .8381 


39 


gaactgatcttgcttaaatgattta 


att 


tag 


101031 


960RF299 


-3 


6993. .7112 


39 


cattagcattagcgaatgggtttga 


ttg 


tga 


101032 


96ORF300 


2 


23516. .23632 


38 


actacatctgaacaactaaaatttc 


ate 


tag 


101033 


96ORF301 


2 


25943. .26059 


38 


agattagaagaagaaaaaagaagac 


gtg 


taa 


101034 


96ORF302 


2 


36929, .37045 


38 


tattggggttttgtaacatggggca 


atg 


tag 


101035 


96ORF303 


3 


4476. .4592 


38 


ataaaagctacctagtagcagtact 


atg 


tga 


101036 


96ORF304 


3 


20586. .20702 


38 


tactctaagatagctaaagcaatac 


gtg 


tga 


101037 


96ORF305 


3 


28356. .28472 


38 


cggttaccaatgtgcttgatacgat 


ttg 


taa 


101038 


96ORF306 


-1 


24359. .24475 


38 


acttaaataaaagccgtatcgtgcc 


atg 


taa 


101039 


96ORF307 


-1 


20147. .20263 


38 


ttgtacctatacgagttaactcctt 


att 


tag 


101040 


96ORF308 


-2 


38158. .38274 


38 


ttccgtatccactttctaagaaagc 


gtg 


tga 


101041 


96ORF309 


-2 


35149. .35265 


38 


agcttgtttgtatcgtctttaacga 


at a 


taa 


101042 


96ORF310 


-2 


31423. .31539 


38 


gtaatatgattaggtctcctcttat 


ttg 


taa 


101043 


960RF311 


-2 


10438. .10554 


38 


cgcctttaaatcgttttaggtcact 


ate 


taa 


101044 


960RF312 


-2 


1390. .1506 


38 


gagaacaacacaaacattaacaaca 


ate 


taa 


101045 


960RF313 


-3 


33051. .33167 


38 


acgtcctgtttctagatcgtaatac 


at a 


tag 


101046 


960RF314 


-3 


25194. .25310 


38 


agcaaaccgttaaagataacattga 


ate 


taa 


101047 


960RF315 


-3 


6273. .6389 


38 


cattcttgctaacacgtcagattga 


ctg 


tga 


101048 


960RF316 


-3 


4281. .4397 


38 


ataattcgtattcattaatcattaa 


att 


tag 


101049 


960RF317 


1 


2260. .2373 


37 


atgactccttttctcatatttcttt 


ata 


taa 


101050 


960RF318 


2 


21230. .21343 


37 


atttcacacttttttagttgtctct 


ttg 


taa 


101051 


960RF319 


3 


18018. .18131 


37 


atactgagtcaccaatttaagct eg 


atg 


tag 


101052 


96ORF320 


3 


36972. .37085 


37 


attacagatatcctaagggtttccg 


att 


taa 


101053 


960RF321 


-1 


36302. .36415 


37 


ctcttgagttttttgacctaattta 


ate 


taa 


101054 


960RF322 


-1 


32606, .32719 


37 


ccataagttatttctccagttctat 


att 


taa 


101055 


960RF323 


-1 


11453 . .11566 


37 


ttaaaccgttcttttttatcaattc 


att 


tga 


101056 


960RF324 


-1 


7268. .7381 


37 


tactggttcgccccagtgaagttct 


ata 


tga 


101057 


960RF325 


-2 


32347. .32460 


37 


ttactgcatttgtatatggcgataa 


ate 


tag 


101058 


960RF326 


-2 


24682. .24795 


37 


acgtttattacgctcataaagccat 


ata 


tag 


101059 


960RF327 


-2 


23905. .24018 


37 


aaatggctgtggcgcttgaccatat 


gtg 


taa 


101060 


960RF328 


-2 


21460. .21573 


37 


agagcactaatacgtttttgttctt 


ctg 


tga 


101061 


960RF329 


-2 


21208. .21321 


37 


gacttaacttcttcgatattcatat 


ate 


tga 


101062 


96ORF330 


-2 


18085. .18198 


37 


ccagtcgacaccagcaaagtattct 


ttg 


tag 


101063 


960RF331 


-2 


8170. .8283 


37 


actttgagacgtcgtctgtctctct 


atg 


tag 


101064 


960RF332 


-2 


5971. .6084 


37 


caatttgttttccgttttctcttag 


ttg 


tag 


101065 


960RF333 


-3 


37632. .37745 


37 


accttgcttaatcaagtcgtaatta 


att 


tga 


101066 


960RF334 


-3 


29628. .29741 


37 


ctgagttagtgttgtaaaatgtcat 


ttg 


tag 


101067 


960RF335 


-3 


7164. .7277 


37 


ttagcggatatccgttttctagtaa 


ate 


taa 


101068 


960RF336 


1 


22903 . .23013 


36 


gtaaaaaaagacaatatgactatta 


ctg 


tga 


101069 


960RF337 


1 


43258. ,43368 


36 


taattgacgtggttattttttaggt 


ttg 


taa 


,101070 


960RF338 


2 


12668 . .12778 


36 


gaactggtggaatgggcatggaaca 


ate 


tag 


101071 


960RF339 


2 


28292. .28402 


36 


ttcactgctttaattcagttgctta 


ctg 


taa 


101072 


96ORF340 


2 


35396. .35506 


36 


ttcctaatgaacataagtcaacggt 


att 


tga 


101073 


960RF341 


3 


25428. .25538 


36 


actcgagaacaattagaaaaagcaa 


ttg 


tga 


101074 


960RF342 


-1 


40913 . .41023 


36 


tatctgggaaatttaatctaataaa 


ata 


tga 


101075 


960RF343 


-1 


39173 . .39283 


36 


tgccacattttagtgtcaggattga 


ttg 


taa 


101076 


960RF344 


-1 


37580. .37690 


36 


gggtctacctttaacgtcgtttcag 


ata 


taa 
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101077 


960RF345 


-1 


31556. .31666 


36 


ggattattctttctaataacttcaa 


ttg 


tga 


101078 


960RF346 


-1 


29972. .30082 


36 


ggctactccttatctaaaatataat 


ttg 


taa 


101079 


960RF347 


-1 


28787. .28897 


36 


ctgccaaagtctgtagcaattactt 


ttg 


tga 


101080 


960RF348 


-1 


21839. .21949 


36 


ttaaaatccgataaaataacattgc 


ctg 


tga 


101081 


960RF349 


-1 


3647. .3757 


36 


taaaacttccgaagttacccagcgt 


ttg 


tga 


101082 


96ORF350 


-2 


40801. .40911 


36 


accattccaattttgcccatatgat 


gtg 


tag 


101083 


960RF351 


-2 


38953 . .39063 


36 


tatcttttaaaattctcgtaatagc 


ate 


taa 


101084 


960RF352 


-2 


31585. .31695 


36 


tagctgtcatcactagtatttttga 


ate 


taa 


101085 


960RF353 


-2 


24550. .24660 


36 


atagtccgttttaccgcctcgtact 


att 


tag 


101086 


960RF354 


-2 


20083 . .20193 


36 


atcatcattttgatatttctcaaac 


at a 


tga 


101087 


960RF355 


-2 


991. .1101 


36 


gcatcttggcagtacgacgtaaaac 


ate 


tag 


101088 


960RF356 


-3 


38148. .38258 


36 


taagaaagcgtgcgcgat caaataa 


att 


tga 


101089 


960RF357 


-3 


8790. .8900 


36 


tgaagttatctagcgctatttttct 


ttg 


tag 


101090 


960RF358 


-3 


4458. .4568 


36 


ttcataaaagtattctttgtagtat 


atg 


tag 


101091 


960RF359 


1 


4666. .4773 


35 


ttatcaaaatatacaacttaattaa 


ate 


tag 


101092 


96ORF360 


1 


11569. .11676 


35 


ataaatttaccgaacatgaaaatga 


att 


tga 


101093 


960RF361 


2 


6122. .6229 


35 


ggaaaacaaattgatgttgtagtga 


ttg 


taa 


101094 


960RF362 


-1 


40418. .40525 


35 


ttcgtaggtgtcattacttctttaa 


ttg 


tag 


101095 


960RF363 


-1 


34358. .34465 


35 


gttttgcttgatttcgatttgttga 


atg 


tga 


101096 


960RF364 


-1 


20654 . .20761 


35 


ctatttccactgattccccatctaa 


atg 


tga 


101097 


960RF365 


-1 


8423. .8530 


35 


tcttttttagagttacgaggtttca 


att 


tag 


101098 


960RF366 


-1 


2402 . .2509 


35 


tgacgtatggcaacattttagatca 


ate 


taa 


101099 


960RF367 


-2 


36607 . .36714 


35 


aaaataaaaagccagtgccgaagca 


ctg 


tag 


101100 


960RF368 


-2 


27061. .27168 


35 


caaatcgtcctgcagcgttcaataa 


ate 


tag 


101101 


960RF369 


-2 


26470 . .26577 


35 


atgagttgttaagtttaccccaaat 


ate 


taa 


101102 


96ORF370 


-2 


10327. .10434 


35 


ccgtgccatcttctcggtataagta 


at a 


taa 


101103 


960RF371 


-2 


8650. .8757 


35 


gggtacgggttgttactgttgatat 


ate 


taa 


101104 


960RF372 


-3 


14382. .14489 


35 


gttcttttaattgatctactgttaa 


att 


taa 


101105 


960RF373 


-3 


8151. .8258 


35 


atgtttgttagtctctgtgtagtct 


atg 


taa 


101106 


960RF374 


-3 


5007. .5114 


35 


aaacgatttaagtggaacattattc 


at a 


taa 


101107 


960RF375 


2 


30563. .30667 


34 


cgattagaaatctttaaaaaaggac 


ttg 


tga 


101108 


960RF376 


-1 


19916. .20020 


34 


tctatgtcaggtaatttgtcattaa 


att 


taa 


101109 


960RF377 


-1 


9236. .9340 


34 


cttttctgttagtaattgtttttaa 


ate 


taa 


101110 


960RF378 


-1 


9026. .9130 


34 


actctttatctttagttgcttttaa 


ata 


tag 


101111 


960RF379 


-2 


28447. .28551 


34 


cttttgtgataataaagtttagtgc 


ttg 


tga 


101112 


96ORF380 


-3 


40329. .40433 


34 


ccatttaccttcttgagatgttgga 


ttg 


tga 


101113 


960RF381 


-3 


39801. .39905 


34 


caaaagatgaaggctttttccatac 


ttg 


taa 


101114 


960RF382 


-3 


33831. .33935 


34 


atgttgtttgtaactcgattaagtt 


ate 


tga 


101115 


960RF383 


-3 


33687. .33791 


34 


gttattacgtcttaatacttgtgtt 


gtg 


tag 


101116 


960RF384 


-3 


13530. .13634 


34 


tatacgcactagtactgatcactga 


ttg 


taa 


101117 


960RF385 


-3 


3843. .3947 


34 


tttgattgattgttctagttaagaa 


att 


taa 


101118 


960RF386 


1 


12256. .12357 


33 


ag t ca t aaagaagt t agcaat gt ga 


ttg 


tag 


101119 


960RF387 


2 


2207. .2308 


33 


tccaagactctttaactgttaactt 


ate 


tag 


101120 


960RF388 


2 


2519. .2620 


33 


attgttgaatttcgattgatctaaa 


atg 


tga 


101121 


960RF389 


2 


22517. .22618 


33 


agaagtaaaatgcgtaatgctttag 


atg 


tag 


101122 


96ORF390 


2 


27302 . .27403 


33 


ttccaaaattgggctaatagtgtag 


ctg 


taa 


101123 


960RF391 


2 


32384 . .32485 


33 


actaaaaaggttgagaaagctgtag 


atg 


taa 


101124 


960RF392 


2 


39287. .39388 


33 


aaaaacggtactgtagtatcaatca 


ate 


tag 


101125 


960RF393 


3 


18153. .18254 


33 


gtagtatatgccgactttgatttga 


atg 


taa 


101126 


960RF394 


3 


24189. .24290 


33 


tcagaccctaacattaacaaactag 


ttg 


tga 


101127 


960RF395 


-1 


15266. .15367 


33 


tcgataatttgtatagcttgtttta 


atg 


tag 


101128 


960RF396 


-2 


32239. .32340 


33 


ttttagtgaaagcatctagtgttga 


ata 


tag 


101129 


960RF397 


-2 


16123. .16224 


33 


ttatgtgtgcctatcatattaacaa 


ttg 


tag 


101130 


960RF398 


-2 


13648. .13749 


33 


tctttaactgaatgttgaatagcat 


ttg 


tag 


101131 


960RF399 


-2 


10987. .11088 


33 


acttctgtaggtattcttatatcaa 


ttg 


tga 


101132 


96ORF400 


-2 


3382. .3483 


33 


cttactggtaattcttcaaaattaa 


atg 


taa 


101133 


96ORF401 


-3 


40794. .40895 


33 


ccatatgatgtgaaagtgtttaaat 


ttg 


taa 


101134 


96ORF402 


-3 


39978. .40079 


33 


atattcctaaatcacttgaacctaa 


att 


tga 


101135 


96ORF403 


-3 


38607. .38708 


33 


atcttcagtgtaaaatcgacagcca 


atg 


tag 


101136 


96ORF404 


-3 


21288. .21389 


33 


cagacaccgtcttaagtccctttag 


ata 


taa 
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SEQUENCE INFORMATION FOR PHAGES MATCHING WITH TABLE 1 

M32695 

Bacteriophage PM2 nuclease cleavage site 
gi|166145|gb|M32695|BM2NCS [166145] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M32693 

Bacteriophage PM2 Hind III fragment 4 
gi|166144|gb|M32693|BM24HIND3 [166144] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M32693 

Bacteriophage PM2 Hind III fragment 4 
gi|166144|gb|M32693|BM24HIND3 [166144] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M32694 

Bacteriophage PM2 Hind III fragment 3 

gi| 166143 |gb|M32694|BM23HIND3 [ 1 66 143] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 



4* M26134 

U I Bacteriophage PM2 structural protein gene containing purme/pyrirnidine rich 
4* regions and anti-Z-DNA-IgG binding regions, complete cds 
f|l gi|289360|gb|M26 134|BM2PROTIV [289360] 

(View GenBank report,FASTA report,ASN.l report, Graphical view,l MEDLINE link, or 1 protein link ) 



J02452 

bacteriophage fi S'-terrninal region ma 
gi|2 1 5409|gbj J02452|PFITR3 [2 15409] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
AF020798 

Bacteriophage Chpl genome DNA, complete sequence 
gi|217761|dbj|D00624|BCPl [217761] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 12 protein links, or 1 genome link ) 
X72793 

Clostridium botulinum C phage BONT/C1, ANTP-139, ANTP-33, ANTP-17, ANTP-70 
genes and ORF-22 

gi|516171|emb|X72793|CBCBONT [516171] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 6 protein links, or 4 nucleotide neighbors ) 
X51464 

Clostridium botulinum D Phage C3 gene for exoenzyme C3 
gi| 14907|emb|X5 14641CBDPE3 [14907] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
D90210 

Bacteriophage c-st (from C. botulinum) Cl-tox gene for botulinum CI neurotoxin 
gi|217780idbj|D90210|CSTClTOX [217780] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
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S49407 

type D neurotoxin [bacteriophage d-16 phi, host = C. botulinum, type D, CB16, Genomic, 4087 nt] 
giI260238|gb|S49407|S49407 [260238] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
X53370 

Bacteriophage phi29 temperature sensitive mutant TS2(98) DNA polymerase gene 
gi|15733|emb|X53370|POTS298 [15733] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 7 nucleotide neighbors ) 
X53371 

Bacteriophage phi29 temperature sensitive mutant TS2(24) DNA polymerase gene 
gi| 1 573 1 |embiX5337 1 |POTS224 [1 573 1 ] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 7 nucleotide neighbors ) 
X05973 

Bacteriophage phi29 prohead RNA 

gi] 15680|emb|X05973|POP29PRO [15680] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, or 4 nucleotide neighbors ) 
V01 155 

Left end of bacteriophage phi-29 coding for 15 potential proteins Among 

these are the terminal protein and the proteins encoded by the genes 1, 2 (sus), 3, and (probably) 4 

gill56591emb|V01155|POP29B [15659] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 16 protein links, or 16 nucleotide neighbors) 
X73097 

Bacteriophage phi-29 left origin of replication 
gi|312194|emb|X73097|BP29ORIL [312194] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 5 nucleotide neighbors ) 
M 14430 

Bacteriophage phi-29 gene- 17 gene, complete cds 
gi|2 15321 |gb|Ml 4430IP29G1 7A [215321] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, 6 protein links, or 8 nucleotide neighbors ) 
M 14431 

Bacteriophage phi-29 gene- 16 gene, complete cds 
gi|2 1 53 1 9igb|Ml 443 1 |P29G 1 6A [21531 9] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 7 nucleotide neighbors ) 
M20693 

Bacteriophage phi-29 DNA, 3' end 
gi|215343|gb|M20693|P29REPINB [215343] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 4 nucleotide neighbors ) 
M21016 

Bacteriophage phi-29 DNA, 5' end 

gi|2 1 5 342|gb|M2 1 0 1 6|P29REPINA [2 1 5342] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
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Ml 2456 

Bacteriophage phi-29 genes 9, 10 and 1 1 encoding p9 tail, incomplete, plO 
connector, complete, and pi 1 lower collar, incomplete, respectively 
gi|215338|gb|M12456|P29P9 [215338] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 3 protein links, or 2 nucleotide neighbors ) 
M 14782 

Bacillus phage phi-29 head morphogenesis, major head protein, head fiber 
protein, tail protein, upper collar protein, lower collar protein, pre-neck* 

appendage protein, morphogenesis(13), lysis, morphogenesis(15), encapsidation genes, complete cds 
gi|215323|gb|M14782|P29LATE2 [215323] 

(View GenBank report,FASTA report,ASN.l report, Graphical view.l MEDLINE link, 1 1 protein links, or 1 1 nucleotide neighbors) 
M26968 

Bacteriophage phi-29 (from Bacillus subtilis) proteins pi delta- 1 genes, complete cds, and the sus 1(629) mutation 
gi|341558|gb|M26968|P29PlD!A [341558] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
J02448 

Bacteriophage fl, complete genome 
gi| 166201 |gb|J02448jF!CCG [166201] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 10 protein links, 205 nucleotide neighbors, 
or 1 genome link ) 

M24832 

Bacteriophage f2 coat protein gene, partial cds 
gi|166228|gb|M24832|F2CRNACA [166228] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
J02451 

Bacteriophage fd, strain 478, complete genome 
gi|215394|gb|J02451|PFDCG [215394] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,5 MEDLINE links, 10 protein links, 204 nucleotide neighbors, 
or 1 genome link ) 

M34834 

Bacteriophage fir replicase gene, 5' end 
gi|166139|gb|M34834|BFRREGRA [166139] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l protein link, or 9 nucleotide neighbors ) 
M38325 

Bacteriophage fr replicase gene, 5 1 end 
gi|166137|gb|M38325|BFRREGR [166137] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 9 nucleotide neighbors ) 
M35063 

Bacteriophage fr coat protein replicase cistron (R region) RNA 
gi| 1 66 134|gb|M35063|BFRRCRRA [ 1 66 1 34] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 3 nucleotide neighbors ) 
S66567 

alpha-atrial natriuretic factor/coat protein=fusion polypeptide [human, 
bacteriophage fr, expression vector pFAN15, PlasmidSyntheticRecombinant, 510 nt] 
gi]435742|gb|S66567|S66567 [435742] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 15 nucleotide neighbors ) 



16.4 



X15031 

Bacteriophage fr RNA genome 
gi|15071iemb|X15031|LEBFRX [15071] 

(View GenBank report,FASTA report,ASN.l report 3 Graphical view,l MEDLINE link, 4 protein links, 9 nucleotide neighbors, 
or 1 genome link ) 

U51233 

Mus musculus neutralizing anti-RNA-bacteriophage fr immunoglobulin variable 
region light chain (IgM) mRNA, partial cds 
gi|1277150igb|U51233|MMU51233 [1277150] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 1669 nucleotide neighbors ) 
U51232 

Mus musculus neutralizing anti-RNA-bacteriophage fr immunoglobulin variable region heavy chain (IgM) mRNA, partial cds 
gi|1277148|gb|U51232|MMU51232 [1277148] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 1073 nucleotide neighbors ) 
U02303 

Bacteriophage Ifl, complete genome 
gi|3676280|gb!U02303|B2U02303 [3676280] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 10 protein links, or 1 genome link ) 

V00604 
Phage M 13 genome 

gi|14959|embiV00604|INM13X [14959] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view,l MEDLINE link, 10 protein links, or 205 nucleotide 
neighbors ) 

A32252 

Synthetic bacteriophage Ml 3 protein III probe 
gi|1567340|emb|A32252|A32252 [1567340] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 
A32251 

Synthetic bacteriophage Ml 3 protein EI probe 
gi|1567339|emb|A3225I|A32251 [1567339] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
Ml 2465 

Bacteriophage M13 mplO mutations in lac operon 
gi|215210|gb|M12465|M13LACMUT [215210] 

(View GenBank report,FASTA repor^ASN.l report,Graphical view,l MEDLINE link, or 215 nucleotide neighbors ) 
M24177 

Synthetic Bacteriophage M13 (clone M13.SV.B12) SV40 early promoter region DNA 
gi|209416|gb|M24177|SYNSVB12 [209416] 

(View GenBank report,F ASTA report,ASN. 1 report,Graphical view, 1 MEDLINE link, or 1 nucleotide neighbor ) 
M24176 

Synthetic Bacteriophage M13 (clone M13.SV.B1 1) SV40 early promoter region DNA 
gij209415|gb|M24176|SYNSVBl 1 [209415] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
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M24175 

Synthetic Bacteriophage M13 (clone M13.SV.8) SV40 early promoter region DNA 
gi|208806|gb|M24175|SYNM13SV8 [208806] 

(View GenBank report,FASTA report, ASN.l report } Graphical view,! MEDLINE link, or 242 nucleotide neighbors ) 
Ml 9979 

Synthetic hybrids; recombinant DNA from bacteriophage M13 and plasmid pHV33 
gi|207813|gb|M19979|SYN33M13M [207813] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 617 nucleotide neighbors ) 
M19565 

Synthetic hybrids; recombinant DNA from bacteriophage M13 and plasmid pHV33 
gi|207808|gb|M19565iSYN33M13H [207808] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 567 nucleotide neighbors ) 
M19564 

Synthetic hybrids; recombinant DNA from bacteriophage M13 and plasmid pHV33 
gi|207807|gb|M19564|SYN33M13G [207807] 

(View GenBank report,FASTA report,ASN.l report, Graphical view,l MEDLINE link, or 12 nucleotide neighbors ) 
M19563 

Synthetic hybrids; recombinant DNA from bacteriophage M13 and plasmid pHV33 
gi|207806[gb[M19563iSYN33M13F [207806] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 262 nucleotide neighbors ) 
M19561 

Synthetic hybrids; recombinant DNA from bacteriophage M13 and plasmid pHV33 
gij207804|gb|M19561|SYN33M13D [207804] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, or 27 nucleotide neighbors ) 
M19560 

Synthetic hybrids; recombinant DNA from bacteriophage Ml 3 and plasmid pHV33 
gi|207803igb|M19560|SYN33M13C [207803] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
M19559 

Synthetic hybrids; recombinant DNA from bacteriophage M13 and plasmid pHV33 
gi|207802|gb|M19559|SYN33M13B [207802] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 227 nucleotide neighbors ) 
M10568 

Bacteriophage M13 replicative form II, replication origin, specific nick location 
gi|215220|gb|Ml0568|Ml3ORIB [215220] 

(View GenBank report,FASTA report,ASN.l repor^Graphical view,l MEDLINE link, or 650 nucleotide neighbors ) 
M10910 

Bacteriophage M13 gene II regulatory region and M13sjl mutant 
gi|215209igblM10910|M13IIREG [215209] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view,l MEDLINE link, or 72 nucleotide neighbors ) 
M38295 

Bacteriophage M13 Haelll restriction fragment DNA 
gi|2 1 5208|gb|M3 8295|M 13HAEIII [2 1 5208] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 67 nucleotide neighbors ) 
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E02067 

DNA encoding a part of Bacteriophage M13 tg 127 
gi|217031 l|dbj|E02067|E02067 [217031 1] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
J02467 

Bacteriophage MS2, complete genome 
gi|215232|gb|J02467|MS2CG [215232] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,8 MEDLINE links, 4 protein links, 20 nucleotide neighbors, 
or 1 genome link ) 

AJ004950 

Bacteriophage PI ban gene 
gi|3688226|emb|AJ01 1592|BP101 1592 [3688226] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 protein link ) 
U88974 

Bacteriophage PI structural lytic transglycosylase (orf47), pep44b (orf44b), 

pep44a (orf44a), and pep43 (orf43) genes, complete cds; and pep42 (orf42) gene, partial cds 

gi|2661099|gb|AF035607|AF035607 [2661099] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,5 protein links, or 1 nucleotide neighbor ) 

AJ000741 
Bacteriophage PI darA operon 
gi|2462938|emb|AJ00074 1 |BPAJ764 1 [2462938] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 10 protein links, or 31 nucleotide neighbors) 
X01828 

Bacteriophage PI recombinase gene cin 
gi|15133jemb|X01828|MYPlCIN [15133] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 3 nucleotide neighbors ) 
X98146 

Bacteriophage PI DNA sequence around the Op88 operator 
gi|1359513|emb|X98146|BP10P880P [1359513] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 nucleotide neighbor ) 
S61175 

imml operon: icd=cell division repressor, antl=antirepressor {promoters 
P51a, P51b} [bacteriophage PI, Genomic, 728 nt] 
gi|385908|gb|S61 175|S61 175 [385908] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link; or 3 nucleotide neighbors ) 

X87824 
Bacteriophage PI gene 26 
gi|861164[emb|X87824|XXBPlG26 [861164] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 protein link ) 
X15638 

Phage PI DNA for lytic replicon containing promoter P53 and two open reading frames 
gi|15735|emb|X15638|PPlLREP [15735] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 3 protein links, or 24 nucleotide neighbors ) 
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X17512 

Bacteriophage PI DNA for immunity region imml 
gi]15479|emb|X17512|PlIMMUNIY [15479] 

(View GenBank report,FASTA report,ASN.l report,Graphicai view,2 MEDLINE links, or 4 nucleotide neighbors ) 
XI 6005 

Bacteriophage PI cl gene for Pic 1 repressor protein 
gi|15477|emb|X16005|PlCl [15477] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 3 nucleotide neighbors ) 
X03453 

Bacteriophage PI ere gene for recombinase protein 
gi|15135|emb]X03453|MYPlCRE [15135] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 12 nucleotide neighbors ) 
X06561 

Bacteriophage PI cl gene 5'-region 
gi|15128|emb|X06561|MYP!Cl [15128] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 4 protein links, or 6 nucleotide neighbors ) 
V01534 

Bacteriophage PI genome fragment (IS2 insertion spot). This regions contains 

four unidentified reading frames and is known as insertion hot spot for IS2 insertion sequences 

gi|15118|emb|V01534|MYOVPl [15118] 

(View GenBank report,FASTA report,ASN.l report,Graphicai view,l MEDLINE link, 4 protein links, or 3 nucleotide neighbors ) 

X56951 

Bacteriophage PI gene 10 
gi|406728|emb|X5695 1 |BPP 1GP 1 0 [406728] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 3 protein links, or 1 nucleotide neighbor ) 
K02380 

Bacteriophage PI replication region including repA, parA, and parB genes and 
incA, incB, and incC incompatibility deterniinants 
gi|215652|gb|K02380|PPlREP [215652] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,5 MEDLINE links, 4 protein links, or 8 nucleotide neighbors ) 
X87674 

Bacteriophage PI lydA & lydB genes 
gi|974763|embjX87674|BACPlLYD [974763] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 

X87673 
Bacteriophage PI gene 17 
gi|974761|emb|X87673|BACP117 [974761] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
M16618 

Bacteriophage PI cl repressor binding sites 
gi|215600|gb|M16618|PPlCl [215600] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 3 nucleotide neighbors ) 
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SEG_PP1CIN 

Bacteriophage PI cin gene encoding recombinase, cixL recombination site, and 5' end of C invertible element 
gi|215607|gb||SEG_FPlCIN [215607] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
K03173 

Bacteriophage PI C invertible element, right end, and cixR recombination site 
gii215606|gb|K03173|PPlCIN2 [215606] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 
215605 

Bacteriophage PI cin gene encoding recombinase, cixL recombination site, and 5* end of C invertible element 
gil215605|lcl|X01828 [215605] 

(View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
M25470 

Bacteriophage PI tail fiber protein gene, complete cds 
gi|341349|gb|M25470|PPlTFPR [341349] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, 3 protein links, or 3 nucleotide neighbors ) 
M34382 

Bacteriophage PI sim region proteins, complete cds 
gi|215661|gb|M34382|PPlSIM [215661] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 protein links ) 
M8I956 

Bacteriophage PI R protein (R) gene, complete cds 
gi|215658|gb|M81956|PPlRP [215658] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 4 nucleotide neighbors ) 
M37080 

Bacteriophage PI mini-Pi plasmid origin of replication 
gi|215657|gb|M37080|PPlREPOR [215657] 

(View GenBank report,FASTA report, ASN.l report,Graphical view.l MEDLINE link, or 46 nucleotide neighbors ) 
M27041 

Bacteriophage PI ref gene, complete cds 
gi|215650|gb|M27041|PPlREF [215650] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
L01408 

Bacteriophage PI partition protein (parB) gene, 3 1 end • 
gi!215642|gb|L01408|PPlPARB [215642] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l protein link, or 41 nucleotide neighbors ) 

SEG_PP1PAR 
Bacteriophage miniplasmid PI parA gene, 5' end 
gi|215639igb||SEGJ>PlPAR [215639] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 2 protein links, or 48 nucleotide neighbors ) 
M36425 

Bacteriophage miniplasmid PI parB gene, 3' end 
gii215638|gb|M36425|PPlPAR2 [215638] 

(View GenBank report,FASTA report^ASN.l report, or Graphical view) 
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M36424 

Bacteriophage miniplasmid PI parA gene, 5' end 
giI215637|gb|M36424|PPlPARl [215637] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
M11129 

Bacteriophage PI miniplasmid origin of replication region 
gi|215632|gb|M11129|PP10RIM [215632] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 43 nucleotide neighbors ) 
M25414 

Bacteriophage PI cl repressor binding site, operator 88 (Op88) 
gi|215631|gb|M25414|PP10P88A [215631] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 3 nucleotide neighbors ) 
M25413 

Bacteriophage PI cl repressor binding site, operator 68 (Op68) 
gi|215630|gb|M25413|PPlOP68A [215630] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 
M25412 

Bacteriophage PI cl repressor binding site, operator 21 (Op21) 
gi|215629|gb|M25412|PP10P21A [215629] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M10510 

Bacteriophage PI recombination site loxR 
gi|215628|gb|M10510|PPlLOXR [215628] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M10287 

Bacteriophage PI loxP X loxP recombination site 
gi|215627|gb|M10287|PPlLOXPX [215627] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 13 nucleotide neighbors ) 
M10494 

Bacteriophage PI recombination site loxP 
gi|215626|gb|M10494|PPlLOXP [215626] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 134 nucleotide neighbors ) 
M10511 

Bacteriophage PI recombination site loxL 
gi|215625igb|M10511|PPlLOXL [215625] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M10512 

Bacteriophage PI recombination site loxB 
gi|215624|gb|M10512|PPlLOXB [215624] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
M10145 

Bacteriophage PI genome fragment with recombination site loxP 
gi|215623|gb|M10145|PPlCREX [215623] 

(View GenBank report,FASTA report,ASN.l report, Graphical view,l MEDLINE link, or 21 nucleotide neighbors ) 
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M13327 

Bacteriophage PI Cin recombinase activated cross over site, junction IV, clone pSHI326 
gi|215622|gb|M13327|PPlCN26IV [215622] 

(View GenBank report,FASTA report.ASN.1 report,Graphical view.l MEDLINE link, or 7 nucleotide neighbors ) 
M13325 

Bacteriophage P 1 Cin recombinase activated cross over site, junction II, clone pSHI326 
gi|215621|gb|M13325|PPlCN26II [215621] 

(View GenBank report,FASTA report.ASN.1 report,Graphical view.l MEDLINE link, or 1401 nucleotide neighbors 
M13323 

Bacteriophage PI Cin recombinase activated cross over site, junction IV, clone pSHI325 
gi|215620|gb|M13323|PPlCN25IV [215620] 

(View GenBank report,FASTA report,ASN.l report.Graphical view.l MEDLINE link, or 7 nucleotide neighbors ) 
M13321 

Bacteriophage PI Cin recombinase activated cross over site, junction II, clone pSHI325 
gi|215619|gb|M13321|PPlCN25II [215619] 

(View GenBank report,FASTA report.ASN.1 report,Graphical view.l MEDLINE link, or 1058 nucleotide neighbors ) 
M13324 

Bacteriophage PI Cin recombinase activated cross over site, junction I, clone pSHI326 
gi|215618|gb|M13324|PPlCIR26I [215618] 

(View GenBank report,FASTA report^.SN. 1 report.Graphical view.l MEDLINE .link, or 7 nucleotide neighbors ) 
M13319 

Bacteriophage PI Cin recombinase activated cross over site, right junction, clone pSHI327 
gi|215617|gb|M13319|PPlCIN27R [215617] 

(View GenBank report.FASTA report,ASN. 1 report, Graphical view.l MEDLINE link, or 7 nucleotide neighbors ) 
M13320 

Bacteriophage PI Cin recombinase activated cross over site, junction I, clone pSHI325 
gi|2 1 56 1 6|gb|M13320|PP 1 CIN25I [2 1 56 16] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 7 nucleotide neighbors ) 
M13318 

Bacteriophage PI Cin recombinase activated cross over site, left junction, clone pSHI324 
gi|2l5615|gb|M13318|PPlCIN24L [215615] 

(View GenBank report,FASTA reporUSN.l report,Graphical view.l MEDLINE link, or 1370 nucleotide neighbors ) 
M13317 

Bacteriophage PI Cin recombinase activated cross over site, right junction, clone pSHI323 
gi|215614|gb|M13317|PPlCIN23M [215614] 

(View GenBank report,FASTA report^SN.l report,Graphical view.l MEDLINE link, or 1055 nucleotide neighbors ) 
M13316 

Bacteriophage PI Cin recombinase activated cross over site, left junction, clone pSHI323 
gi|215613|gb|M13316|PPlCIN23L [215613] • 

(View GenBank report,FASTA report.ASN.1 report,Graphical view.l MEDLINE link, or 7 nucleotide neighbors ) 
M13315 

Bacteriophage PI Cin recombinase activated cross over site, right junction, clone pSHI322 
gi|215612|gb|M13315|PPlCIN22R [215612] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 7 nucleotide neighbors ) 
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M13314 

Bacteriophage PI Cin recombinase activated cross over site, left junction, clone pSHI322 
gi|215611|gb|M13314|PPlCIN22L [215611] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1401 nucleotide neighbors ) 
M13313 

Bacteriophage PI Cin recombinase activated cross over site, right junction, clone pSHI321 
gi|215610|gb|M13313|PPlCIN21R [215610] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, or 7 nucleotide neighbors ) 
M13312 

Bacteriophage PI Cin recombinase activated cross over site, left junction, clone pSHI321 
gi|215609|gb|M13312|PPlCIN21L [215609] 

(View GenBank reportJFASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1058 nucleotide neighbors ) 
Ml 6568 

Bacteriophage PI c4 repressor gene, complete cds 
gi|215603|gb|M16568|PPlC4 [215603] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors 
M13326 

Bacteriophage PI Cin recombinase activated cross over site, junction III, clone pSHI326 
gi|215602|gb|M13326|PPlC26in [215602] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 1192 nucleotide neighbors ) 
M13322 

Bacteriophage PI Cin recombinase activated cross over site, junction III, clone pSHI325 
gi|215601|gbjM13322|PPlC25in [215601] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1231 nucleotide neighbors ) 
J05651 

Bacteriophage PI modulator protein (bof) gene, complete cds 
gi|215598|gb|J05651iPPlBOFYl [215598] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 3 nucleotide neighbors 
M33224 

Bacteriophage PI regulatory protein (bof) gene, complete cds 
gi|215596|gb|M33224|PPlBOFFO [215596] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 3 nucleotide neighbors 
M10288 

E.coli/bacteriophage P 1 loxR recombination site 
gi|146647|gbjM10288|ECOLOXR [146647] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, 1 MEDLINE link, or 3 nucleotide neighbors ) 
M 10289 

E.coli/bacteriophage PI loxL recombination site 
gi|146646|gb|M10289|ECOLOXL [146646] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
M10290 

Exoli loxB site, which can recombine with bacteriophage PI loxP site 
gi|146645|gb|M10290|ECOLOXB [146645] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, 1 MEDLINE link, or 2 nucleotide neighbors ) 
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M10287 

Bacteriophage PI loxP X loxP recombination site 
gi|215627|gb|M10287|PPlLOXPX [215627] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, or 13 nucleotide neighbors ) 
M74046 

Bacteriophage P 1 pacA and pacB genes, complete cds 
gi|215634|gb|M74046|PPlPACAB [215634] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, 1 MEDLINE link, or 2 protein links ) 
M95666 

Bacteriophage PI gene 10, doc and phd genes, complete cds 
gi|463276|gb|M95666|PPlPHDDOC [463276] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 4 protein links, or 1 nucleotide neighbor ) 
M25604 

Bacteriophage Q-beta mutated autonomously replicating sequence MDV1 RNA fragment 
gi|556359|gb|M25604|PQBARSMUT [556359] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 8 nucleotide neighbors ) 
V00643 

first half of the phage Q-beta gene for coat protein 
gi| 1 5088|emb| V00643JLEQBET [1 5088] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
M25167 

Bacteriophage Q-beta RNA fragment recovered from replicase binding complex 
gi|556362|gb|M25 167|PQBREPLICB [556362] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
M24876 

Bacteriophage Q-beta replicase RNA, 5' end 
gi|556360|gb|M24876|PQBREPLICA [556360] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
M25444 

Synthetic bacteriophage Q-beta DNA 

gi|2091 18|gb|M25444|SYNPQBTERM [2091 18] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 8 nucleotide neighbors ) 
M25463 

Bacteriophage Q-beta self-replicating microvariant (+) RNA 
gi|532489|gb|M25463|PQBMVSRRNA [532489] 

(View GenBank reportJFASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 
M25014 

Bateriophage Q-beta RNA replicase gene, 5 f end, and maturation protein gene, 3' end 
gi|294316|gb|M25014|PQBREPLC [294316] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
M25065 

Bacteriophage Q-beta RNA sequence with putative stem loop 
gi|294315|gb|M25065|PQBLOOP [294315] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view.l MEDLINE link, or 3 nucleotide neighbors) 
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Ml 0265 

Bacteriophage Q-beta RNA molecule with the ability to replicate extracellularly 
gij215726|gb|M10265|PQBRNA [215726] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 8 nucleotide neighbors ) 
M24815 

Bacteriophage Q-beta specified replicase subunit RNA, 
gi|215725|gb|M24815|PQBREPL [215725] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 4 nucleotide neighbors ) 
M25461 

Bacteriophage Q-beta plus-strand RNA, 5' terminus 
gi|215724|gb|M25461|PQBPS5E [215724] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
M25462 

Bacteriophage Q-beta plus-strand RNA, 3* terminus 
gi|2 1 5 723|gb|M25462|PQBPS3E [2 1 5723] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 8 nucleotide neighbors ) 
M24871 

Bacteriophage Q-beta nanovariant WSIII RNA 
%\\2 1 5722 jgbjM2487 1 |PQBNVWSIC [2 1 5722] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
M24870 

Bacteriophage Q-beta nanovariant WSII RNA 
gi|2 1 5 72 1 |gb|M24870|PQBNVWSIB [2 1 572 1 ] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
M24869 

Bacteriophage Q-beta nanovariant WSI RNA 
gi|215720|gb|M24869|PQBNVWSIA [215720] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
M10495 

Coliphage Q-beta MDV-1(+) RNA 
gi|215719|gb|M10495|PQBMDVlA [215719] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 10 nucleotide neighbors ) 
• J02484 

bacteriophage qbeta coat protein cistron first half 
gi|215717|gb|J02484|PQBCP5 [215717] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
M57754 

Bacteriophage Q-beta minus strand RNA, 5 f terminus 
gi|215716|gb|M57754|PQBBMS5E [215716] 

(View GenBank report,FASTA report, ASN J report,Graphical view, or 8 nucleotide neighbors ) 
M24297 

Bacteriophage Q-beta 5'-terminal region of the minus strand 
gi|215715|gb|M24297iPQB5END [215715] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 8 nucleotide neighbors ) 
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Ml 0695 

Bacteriophage Q-beta, MDV-1 RNA 
gi|215714|gb|M10695|PQBlIR [215714] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,2 MEDLINE links, or 12 nucleotide neighbors ) 
M24827 

Bacteriophage R17 A protein gene, 5 f end 
gi|216078|gb|M24827|R17RNACIS [216078] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 5 nucleotide neighbors ) 
M24829 

Bacteriophage R17 coat protein gene, 5* end 
gi|216075|gb|M24829|R17CP5 [216075] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 5 nucleotide neighbors ) 
J02488 

bacteriophage rl7 rna synthetase initiation site 
gi|216080|gb|J02488|R17RNASYN [216080] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,3 MEDLINE links, 2 protein links, or 6 nucleotide neighbors ) 
J02487 

bacteriophage r 1 7 coat protein initiation site 
gi|216073|gb|J02487|R17COATP [216073] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view, or 1 MEDLINE link ) 
J02486 

bacteriophage r!7 a protein initiation site 
gi|216071|gb|J02486|R17APROT [216071] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
M24826 

Bacteriophage R17 coat protein RNA fragment 
gi|216077|gb|M24826|R17CPRAA [216077] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 7 nucleotide neighbors ) 
M24296 

Bacteriophage R17 3'-terrninal fragment A RNA 
gi|2 1 6070|gb|M24296|Rl 73TFA [2 1 6070] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 9 nucleotide neighbors ) 
1TFN 

structure refinement for a 24-nucleotide rna hairpin, nmr, rrnnirnized average 

structure ribonucleic acid, hairpin, bacteriophage r!7 moHd: 1; molecule: rl7c; chain: null; engineered: yes 
gi|1942336|pdb|lTFN| [1942336] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 structure link ) 
1RPEA 

rna (5*-d(gpgpgpapcpupgpapcpgpapupcpapcpgp cpapgpupcpupapu-3') (24-mer rna 
hairpin coat protein binding site for bacteriophage rl7) (nmr, rrnnirnized average structure) 
gi|1421020|pdb|lRHT| [1421020] 

(View GenBank report,FASTA report,ASN.l report, Graphical view, or 1 structure link ) 
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Ml 4428 

Bacteriophage SI 3 circular DNA, complete genome 
gi|216089|gb|M14428|S13CG [216089] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 12 protein links, 26 nucleotide neighbors 
or 1 genome link ) 

J05393 

Bacteriophage Tl DNA N-6-adenine-methyltransferase (M.T1) gene, complete cds 
gi|166163|gb|J05393|BTlNAMTA [166163] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 protein links ) 
L46845 

Bacteriophage T2 frd3, frd2 genes, complete cds 
gi|951387|gb|L46845|PT2FRD32G [951387] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,2 protein links, or 17 nucleotide neighbors ) 
L43611 

Bacteriophage T2 fibritin (wac) gene, complete cds 
gi!903869|gb|L43611|PT2WAC [903869] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 4 nucleotide neighbors ) 
M24812 

Bacteriophage T2 secondary structure RNA sequence 
gi|215796|gbjM24812|PT2RNA [215796] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 4 nucleotide neighbors ) 
M22342 

Bacteriphage T2 DNA-(adenine-N6)methyltransferase (dam) gene, complete cds 
gi!215792igb|M22342|PT2DAM [215792] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
S57515 

orf 61.2 {intergenic region between 41 and 61} [bacteriophage T2, Genomic, 323 nt] 
gi|298524|gb|S57515|S57515 [298524] 

(View GenBank report,FASTA report^SN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
X05312 

Bacteriophage T2 gene 38 for receptor recognizing protein 
gi|15197|emb|X05312|MYT2G38 [15197] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
X04442 

Bacteriophage T2 gene 37 for receptor recognizing protein 
gi|15195|embjX04442|MYT2G37 [15195] 

(View GenBank report,FASTA report>ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
X12460 

Bacteriophage T2 gene 32 mRNA for single-stranded DNA binding protein 
gi|15192|emb|X124601MYT2G32 [15192] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 14 nucleotide neighbors ) 
X57797 

Bacteriophage T2 gene for gp!2 
gi|14875|emb|X56555|BT2GP12 [14875] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 2 nucleotide neighbors ) 
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X01755 

Bacteriophage T2 tail fiber gene 36 
gi|15189|emb|X01755|MYT2F36 [15189] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
M 14784 

Bacteriophage T3 strain amNG220B right end, tail fiber protein, lysis protein and DNA packaging proteins complete cds 

gi| 2 1 58 1 0|gblM 147841PT3RE [215810] , P 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view, 1 MEDLINE link, 9 protein links, or 1 0 nucleotide neighbors ) 

SEG_PT3RNAPOL 
Bacteriophage T3 RNA polymerase III gene, 5' end 
gi|710559|gbj|SEG_PT3RNAPOL [710559] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view, 1 MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
M22610 

Bacteriophage T3 RNA polymerase III gene, 3' end 

gi|340722|gb|M22610|PT3RNAPOL2 [340722] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

M22609 

Bacteriophage T3 RNA polymerase III gene, 5' end 

gi|340721|gb|M22609|PT3RNAPOLl [340721] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

X05031 

Bacteriophage T3 gene region 1-2.5 with primary origin of replication 
gi| 157 19jemb|X0503 1 |POT30RI [ 1 571 9] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 1 protein links, or 5 nucleotide neighbors ) 
X03964 

Bacteriophage T3 early control region pos. 308-810 from genome left end 
gi|15718!emb|X03964|POT3EP [15718] 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view,2 MEDLINE links, or 20 nucleotide neighbors ) 
X17255 

Bacteriophage T3 gene 1 to gene 11 

gi| 15682|emb|X17255|POT3 1 1 1G [1 5682] 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view,4 MEDLINE links, 36 protein links, 17 nucleotide neighbors, 
or 1 genome link ) 

XI 5840 

Phage T3 gene 10 

gi|15625|emb|X15840|PODT3G10 [15625] 

(View GenBank report,FAS~TA report, ASN. 1 report,Graphical view,l MEDLINE link, or 3 nucleotide neighbors ) 
X02981 

Bacteriophage T3 gene 1 for RNA polymerase 
gi|15561|emb|X02981|PODOT3P [15561] 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view.l MEDLINE link, 1 protein link, or 3 nucleotide neighbors ) 
J02503 

bacteriophage t3 5' end, terminally redundant sequence (trs) 
gi|2 1581 6|gb| J02503 |PT3TRS 1 [215816] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
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SEG_PT3TRS 

bacteriophage t3 5* end, terminally redundant sequence (trs) 
gi|2 1 5 8 1 8|gb||SEG_PT3TRS [215818] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 
J02504 

bacteriophage t3 3* end, terminally redundant sequence (trs) 
gi|215817|gb|J02504|PT3TRS2 [215817] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

HYPERLINK http://www.rs.noda.sut.ac.jp/-kunisawa h t tp://www.rs.noda.sut.acjp/~kunisawa 
Bacteriophage T4 genomic database compiled by Arisaka et al. 

X95646 

Bacteriophage T5 DNA for region 60.5%-71% of the T5 genome 
gi|2791557|emb|AJ001191|BTJ001191 [2791557] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,7 MEDLINE links, 12 protein links, or 6 nucleotide neighbors ) 
X56847 

..^ Bacteriophage T5 genomic region encoding early genes D 1 0-D 1 5 
W gi| 15407|emb|X12930|MYT5D10 [15407] 

tfl (View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 5 protein links, or 4 nucleotide neighbors ) 
AF039886 

Bacteriophage T5 subclone T5.5.3r5.18r, single pass sequence, genomic survey sequence 
||| gi|28 1 1 1 54|gb|AF039886|AF039886 [28111 54] 

I « s (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
^ AF039885 

* Bacteriophage T5 subclone T5.40f,41f, single pass sequence, genomic survey sequence 
^ gi|281 1 153|gb|AF039885|AF039885 [281 1 153] 

rtl (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
ffj AF039884 

■ w p Bacteriophage T5 subclone T5.26.fr, single pass sequence, genomic survey sequence 
^ gi|2811152|gb|AF039884|AF039884 [2811152] 

(View GenBank report,FASTA report, ASN. 1 report, or Graphical view) 

AF039883 

Bacteriophage T5 subclone 10-T5.5.7F, single pass sequence, genomic survey sequence 
gi|2811151|gb|AF039883|AF039883 [2811151] 

(View GenBank report,FASTA report, ASN. 1 report, or Graphical view) 
AF039882 

Bacteriophage T5 subclone 41-T5.5.4BF, single pass sequence, genomic survey sequence 
gi|28 1 1 1 50|gb| AF039882|AF039882 [28111 50] 

(View GenBank report,FASTA report, ASN. 1 report, or Graphical view) 
AF039881 

Bacteriophage T5 subclone 39-T5.5.4aF, single pass sequence, genomic survey sequence 
gi|28 1 1 149|gb|AF03988 1 1AF03988 1 [28 1 1 149] 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view, or 1 
nucleotide neighbor ) 
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AF039880 

Bacteriophage T5 subclone 19-T5.72r, single pass sequence, genomic survey sequence 

gi|28 1 1 148|gb|AF039880|AF039880 [28 1 1 148] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

AF039879 

Bacteriophage T5 subclone 18-T5.7.2F, single pass sequence, genomic survey sequence 
gi|2811147|gb|AF039879|AF039879 [2811147] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) * 
AF039878 

Bacteriophage T5 subclone 1 1-T5.5.7R, single pass sequence, genomic survey sequence 
gi|28 1 1 146|gb|AF039878|AF039878 [2811 146] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 2 
nucleotide neighbors ) 

AF039877 

Bacteriophage T5 subclone T5.4FR, single pass sequence, genomic survey sequence 
gi|2811145|gb|AF039877|AF039877 [2811145] 
1*3 (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

^AF039876 

Bacteriophage T5 subclone 22-T5.16R, single pass sequence, genomic survey sequence 
W gi|2811144|gb|AF039876|AF039876 [2811144] 

=|~ (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
|||AF039875 

f|s Bacteriophage T5 subclone 21-T5.16R, single pass sequence, genomic survey sequence 
^ gi|2811143|gb|AF039875|AF039875 [2811143] 

T , (View GenBank reportJFASTA report,ASN. 1 report, or Graphical view) 
^AF039874 

W Bacteriophage T5 subclone 21-T5.16F, single pass sequence, genomic survey sequence 
111 gi|2811142|gb|AF039874|AF039874 [2811142] 

(View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

^AF039873 

Bacteriophage T5 subclone 09-T5.6F, single pass sequence, genomic survey sequence 
gi|28111411gb|AF039873|AF039873 [2811141] 
, (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

AF039872 

Bacteriophage T5 subclone 09-T5.6R, single pass sequence, genomic survey sequence 
gi|28 1 1 140|gb|AF039872| AF039872 [28 1 1 140] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 2 nucleotide neighb 
AF039871 

Bacteriophage T5 subclone 04-T5.26.R, single pass sequence, genomic survey sequence 
gi|2811139|gb|AF039871|AF039871 [2811139] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
AF039870 

Bacteriophage T5 subclone 13-T5.42F, single pass sequence, genomic survey sequence 
gi|281 1 138|gb|AF039870|AF039870 [2811 138] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
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X69460 

Bacteriophage T5 ltf gene for L-shaped tail fibers 
gi|15415|emb|X69460|MYT5LTF [15415] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 1 protein link, or 4 nucleotide neighbors ) 
X03402 

Bacteriophage T5 D15 gene for 5 ! exonuclease 
gi|15413|emb|X03402]MYT5EXOG [15413] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
Z11972 

Bacteriophage T5 tRNA-Tyr, tRNA-Glu, tRNA-Trp, tRNA-Phe, tRNA-Cys and 
tRNA-Asn genes, and ORFs 91aa, 90aa, 42aa and 172aa 
gi|15795|emb|Z11972|T56TRNAG [15795] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 4 protein links, or 3 nucleotide neighbors ) 
X03898 

Bacteriophage T5 genes for tRNA-His, -Ser and -Leu 
gi|15786|emb|X03898|STT5RNl [15786] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 2 MEDLINE links ) 
X04177 

Bacteriophage T5 gene for transfer RNA-Gln 
gi|15421|emb|X04177|MYT5TRNQ [15421] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
X03899 

Bacteriophage T5 genes for tRNA-Val, -Lys, -fMet, -Pro and -Ile3 
giil5787|emb|X03899|STT5RN2 [15787] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 
X03798 

Bacteriophage T5 gene for tRNA-Asp (GUC) 
gi!15472iemb|X03798|NCT5TRDG [15472] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
Y00364 

Bacteriophage T5 tRNA gene cluster (27.8%-22.4%) 
giil5420|emb|Y00364|MYT5TRN [15420] 

(View GenBank reportJFASTA repor^ASN.l report,Graphical view,l MEDLINE link, or 13 nucleotide neighbors ) 
X03140 

Bacteriophage T5 DNA with rho-dependent transcription terminator (Hind III-P fragment) 
gi|15417iemb|X03140|MYT5RHO [15417] 

(View GenBank reportJFASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 

Z35070 

Bacteriophage T6 DNA 

gii535228|emb|Z35074|MYEREGBT6 [535228] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
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AF060870 

Coliphage T6 small subunit distal tail fiber (gene 36) gene, partial cds; and large subunit distal tail fiber (gene 37) and tail fiber 
adhesin (gene 38) genes, complete cds 
gi|3676458|gb|AF052605|AF052605 [3676458] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,3 protein links, or 2 nucleotide neighbors ) 
Z35072 

Bacteriophage T6 DNA encoding ORF19.1 gene and gl9 gene 
gi|535232iemb|235072|MYTAILT6 [535232] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 protein links ) 
X12488 

Bacteriophage T6 gene 32 mRNA for single-stranded DNA binding protein 
gi|15843|emb|X12488iMYT6G32 [15843] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 14 nucleotide neighbors ) 
Z78095 

Bacteriophage T6 DNA (1506 bp) 
gi| 1488562|emb|Z78095|BPHZ78095 [1488562] 
^ (View GenBank report,FASTA report, ASN. 1 report,Graphical view, 1 protein link, or 4 nucleotide neighbors ) 

€1 Z35079 

4K Bacteriophage T6 DNA for Ip5, Ip6 

Lfl gi|535215|emb|Z35079|MY57BT6 [535215] 

4!;: (View GenBank report,FASTA report, ASN.l report, Graphical view,l MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
jjj X68725 

i: \ E.coli bacteriophage T6 gene for beta-glucosyl-HMC-alpha-glucosyl-transferase 
- . gi|296439|emb|X68725|ECT6 [296439] 

Lv ( VieW GenBank report,FASTA report,ASN. 1 report,Graphical view, 1 MEDLINE link, 3 protein links, or 1 nucleotide neighbor ) 
III X69894 

Cl Bacteriophage T6 alt gene for ADP-Ribosyltransferase 
fil gi| 15422|embiX69894|MYT6ADP [15422] 

:|| (View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
^ L46846 

Bacteriophage T6 frd3, frd2 genes, complete cds 
gi|95 1390|gb|L46846|PT6FRD32G [951390] 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view, or 2 protein links ) 
M27738 

Bacteriophage T6 translational repressor protein (regA), complete cds 
gi|215993|gb|M27738|PT6REGA [215993] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 5 nucleotide neighbors ) 
M38465 

Bacteriophage T6 DNA ligase gene, complete cds 
gi|215991|gb|M38465|PT6LIG55 [215991] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
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VOl 146 
Genome of bacteriophage 17 
gi|431187|emb|V01146|T7CG [431187] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,13 MEDLINE links, 60 protein links, 105 nucleotide 
neighbors, or 1 genome link ) 

X60322 

Bacteriophage alpha3 genes A, B, K, C, D, E, J, F, G, H 
gi| 14775|emb|X60322|BACALPHA [14775] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 10 protein links, 22 nucleotide neighbors 
or 1 genome link ) ° 

X13332 

Bacteriophage alpha3 DNA for origin of replication 
gi|15093iemb|X13332|MIA3ORPL [15093] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
X12611 

Bacteriophage alpha3 gene for protein A part., finger domain 
gii 1 5092|emb|X 12611 |MIA3 AFIN [ 1 5092] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, 1 protein link, or 6 nucleotide neighbors ) 
X15721 

Bacteriophage alpha3 deletion mutation DNA for the origin region (-ori) of replication 
gi|14774|embiX15721|BA3DMOR9 [14774] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 11 nucleotide neighbors ) 
XI 5720 

Bacteriophage alpha3 deletion mutant DNA for the origin region (-ori) of replication 
gi|14773|emb|X15720|BA3DMOR8 [14773] 

(View GenBank report,FASTA report, ASN.l report,Graphical view.l MEDLINE link, or 1 nucleotide neighbor ) 
X15719 

Bacteriophage alpha3 insertion mutant DNA for the origin region (-ori) of replication 
gi|14772|emb|X15719|BA3DMOR7 [14772] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, or 10 nucleotide neighbors ) 
X15718 

Bacteriophage alpha3 deletion mutation DNA for origin region (-ori) of replication 
gi|14771|emb|X15718|BA3DMOR6 [14771] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 1 nucleotide neighbors ) 
X15717 

Bacteriophage alpha3 deletion mutatnt DNA for origin region (-ori) of replication 
gi| 14770|emb|Xl 57 1 7|BA3DMOR5 [14770] 

(View GenBank report,FASTA report, ASN. 1 report,Graphical view, 1 MEDLINE link, or 9 nucleotide neighbors ) 
X15716 

Bacteriophage alpha3 deletion mutant DNA for origin region (-ori) of replication 
gi| 14769|emb|X 1 57 1 6|B A3DMOR4 [14769] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 10 nucleotide neighbors ) 
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X15715 

Bacteriophage alpha3 deletion mutant DNA for origin region (-ori) of of replication 
gi|14768|emb|X15715|BA3DMOR3 [14768] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 1 nucleotide neighbors ) 
X15714 

Bacteriophage alpha3 deletion mutant DNA for origin region (-ori) of replication 
gi|14767iemb|X15714iBA3DMOR2 [14767] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l &EDLINE link, or 1 1 nucleotide neighbors ) 
X15713 

Bacteriophage alpha3 deletion mutant DNA for the origin region (-ori) of replication 
gi|14766|emb|X15713|BA3DMORl [14766] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 1 nucleotide neighbors ) 
X62059 

Bacteriophage alpha3 origin of cDNA synthesis (oriGA) 
gi| 14763|emb|X62059|AL3ORIGA [14763] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, or 13 nucleotide neighbors ) 
X62058 

Bacteriophage alpha3 origin of cDNA synthesis (oriAA) 
gi| 14762!emb|X62058|AL3ORIAA [14762] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 13 nucleotide neighbors ) 
J02444 

Bacteriophage alpha3 origin of DNA replication 
gi| 1 66 1 03 |gb| J02444| AL30RI [ 1 66 103] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 12 nucleotide neighbors ) 
M25640 

Bacteriophage alpha-3 H protein gene, complete cds 
gi|166101|gb|M25640|AL3HP [166101] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 1 protein link, or 13 nucleotide neighbors ) 
M10631 

Bacteriophage alpha-3 cleavage site for phage phi-X174 gene A protein 
gi| 1 66099igb)M 1 063 1 1 AL3CSA [ 1 66099] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 3 nucleotide neighbors ) 
X00774 

Bacteriophage alpha-3 gene J sequence 
gill5431|emb|X00774|NCBA3J [15431] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 3 protein links, or 2 nucleotide neighbors ) 
M25640 

Bacteriophage alpha-3 H protein gene, complete cds 
gi|166101|gb|M25640|AL3HP [166101] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 13 nucleotide neighbors ) 
M10631 

Bacteriophage alpha-3 cleavage site for phage phi-X174 gene A protein 
gi|166099|gb|M106311AL3CSA [166099] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 3 nucleotide neighbors ) 
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J02459 

Bacteriophage lambda, complete genome 
gi|215104|gb|J02459|LAMCG [215104] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view,87 MEDLINE links, 67 protein links, 190 nucleotide 
neighbors, or 1 genome link ) 

J02482 

Bacteriophage phi-X 1 74, complete genome 
gi|216019|gb|J02482|PXlCG [216019] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,23 MEDLINE links, 1 1 protein links, 26 nucleotide neiohbors 
or 1 genome link ) 

J02454 

Bacteriophage G4, complete genome 
gi|215415|gb|J02454|PG4CG [215415] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,6 MEDLINE links, 1 1 protein links, 20 nucleotide neighbors 
or 1 genome link ) * 

X60323 

Bacteriophage phiK complete genome 

gi| 14781 18|emb|X60323|BPHIKCG [14781 18] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,10 protein links, 18 nucleotide neighbors, or 1 genome link ) 
L42820 

Bacteriophage BF23 tail protein (hrs) gene, complete cds 
gi|1048680|gb|L42820iBBFHRS [1048680] 

(View GenBank report,FASTA report,ASN 1 report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
X54455 

Bacteriophage BF23 gene 17 and gene 18 
gi|14797iemb|X54455|BF231718G [14797] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 protein links, or 2 nucleotide neighbors ) 
M37097 

Bacteriophage BF23 DNA, right end of terminal repetition 
gi|166115|gb|M37097|BBFRIGH [166115] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
M37096 

Bacteriophage BF23 DNA, left end of terminal repetition 
gi|166114|gb|M37096|BBFLEFT [166114] 

(View GenBank report,FASTA report,ASKl report,Graphical view,l MEDLINE link; or 1 nucleotide neighbor ) 
M37095 

Bacteriophage BF23 A2-A3 gene, complete cds, and Al gene, 5* end 
gijl66110Igb|M370951BBFA2A3 [166110] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 3 protein links, or 1 nucleotide neighbor ) 
AF056281 

Bacteriophage BF23 clone bf23.mac5/6.1, genomic survey sequence 

gi|3090930|gb|AF056281|AF056281 [3090930] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
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AF056280 

Bacteriophage BF23 clone bf23.mac3, genomic survey sequence 

gi|3090929|gb|AF056280|AF056280 [3090929] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

AF056279 

Bacteriophage BF23 clone bf23.mac 18/2 134, genomic survey sequence 
gii3090928|gb|AF056279|AF056279 [3090928] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view)" 
AF056278 

Bacteriophage BF23 clone bf23. mac 16/19.33, genomic survey sequence 

gi|3090927|gb|AF056278|AF056278 [3090927] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

AF056277 

Bacteriophage BF23 clone bf23.mac 16/1 9-33, genomic survey sequence 

gi|3090926|gb|AF056277|AF056277 [3090926] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

Cl AF056276 

*!l Bacteriophage BF23 clone bf23 .mac 12/9-9, genomic survey sequence 
£ |: gi!3090925|gb|AF056276|AF056276 [3090925] 

||| (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

J AF056275 

: J: Bacteriophage BF23 clone bf23.macl 1/14-24, genomic survey sequence 

* I gi|3090924|gbiAF056275|AF056275 [3090924] 

" ^ (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

y* AF056274 

f|l Bacteriophage BF23 clone bf23.57r64r, genomic survey sequence 

0 gi|3090923|gb|AF056274|AF056274 [3090923] 

||| (View GenBank report,FASTA report, ASN. 1 report,Graphical view, or 3 nucleotide neighbors ) 

. PS- 

/*? AF056273 

^ Bacteriophage BF23 clone bf23.54fr, genomic survey sequence 
gi!3090922|gbiAF056273|AF056273 [3090922] 
(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

AF056272 

Bacteriophage BF23 clone bf23.47fr.macl0/7, genomic survey sequence 
gii3090921|gb|AF056272|AF056272 [3090921] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 
AF056271 

Bacteriophage BF23 clone bf23.23.66r, genomic survey sequence 

gi|3090920|gb|AF056271|AF056271 [3090920] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

AF056270 

Bacteriophage BF23 clone bf23.23.64f, genomic survey sequence 

gi|3090919|gb|AF056270|AF056270 [3090919] 

(View GenBank report,FASTA report,ASN,l report, or Graphical view) 
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AF056269 

Bacteriophage BF23 clone bf23.23.60r, genomic survey sequence 
gi|30909 1 8|gb| AF056269|AF056269 [30909 1 8] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
AF056268 

Bacteriophage BF23 clone bf23.23.60f, genomic survey sequence 
gi!3090917|gb|AF056268|AF056268 [3090917] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or J nucleotide neighbor ) 
AF056267 

Bacteriophage BF23 clone bf23.23.59r, genomic survey sequence 
gi|3090916|gb|AF056267|AF056267 [3090916] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 
AF056266 

Bacteriophage BF23 clone bf23.23.59f, genomic survey sequence 
gi|3090915|gb|AF056266|AF056266 [3090915] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
^ AF056265 

Bacteriophage BF23 clone bf23.23.56r, genomic survey sequence 
^ gi|3090914|gb|AF056265|AF056265 [3090914] 
+~ (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

Hp AF056264 

f J Bacteriophage BF23 clone bf23.23.56f, genomic survey sequence 
|J] gi|3090913|gb|AF056264|AF056264 [3090913] 

fjj (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
f., AF056263 

J~ Bacteriophage BF23 clone bf23.23.68f55r, genomic survey sequence 
J! gi!3090912|gb|AF056263|AF056263 [3090912] 

Cl (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
yg AF056262 

Bacteriophage BF23 clone bf23.23.43fr.66f, genomic survey sequence 

gi|3090911|gb|AF056262|AF056262 [3090911] 

(View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 

AF056261 

Bacteriophage BF23 clone bf23.23.2fr, genomic survey sequence 
gi|3090910|gb|AF056261|AF056261 [3090910] 

(View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
AF056260 

Bacteriophage BF23 clone bf23.23.55.f, genomic survey sequence 
gi|3090909|gb|AF056260|AF056260 [3090909] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 
AF056259 

Bacteriophage BF23 clone bf23.23.53.r, genomic survey sequence 

gi|3090908|gb|AF056259|AF056259 [3090908] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
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AF056258 

Bacteriophage BF23 clone bf23.23.53.f, genomic survey sequence 
gil3090907|gb|AF056258|AF056258 [3090907] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
AF056257 

Bacteriophage BF23 clone bf23.23.52.r, genomic survey sequence 
gi!3090906|gb|AF056257|AF056257 [3090906] 

(View GenBank report,FASTA report,ASN.l report, or Graphical viewf 
AF056256 

Bacteriophage BF23 clone bf23.23.52.f, genomic survey sequence 

gi|3090905|gb|AF056256|AF056256 [3090905] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

AF056255 

Bacteriophage BF23 clone bf23.23.49.r, genomic survey sequence 

gi|3090904|gb|AF056255|AF056255 [3090904] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

AF056254 

Bacteriophage BF23 clone bf23.23.49.f, genomic survey sequence 

gi|3090903|gbjAF056254|AF056254 [3090903] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

AF056253 

Bacteriophage BF23 clone bf23.23.48.r, genomic survey sequence 

gii3090902|gb|AF056253|AF056253 [3090902] 

(View GenBank reportJFASTA report,ASN.l report, or Graphical view) 

AF056252 

Bacteriophage BF23 clone bf23.23.48.f, genomic survey sequence 

gi|3090901|gb|AF056252|AF056252 [3090901] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

AF056251 

Bacteriophage BF23 clone bf23.23.44.r, genomic survey sequence 

gi|3090900|gb|AF05625 1 |AF05625 1 [3090900] 

(View GenBank report,FASTA report^ASN.l report, or Graphical view) 

' AF056250 

Bacteriophage BF23 clone bf23.23.41.f, genomic survey sequence 

gi|3090899|gb|AF0562501AF056250 [3090899] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

AF056249 

Bacteriophage BF23 clone bf23.23.22.a.r, genomic survey sequence 

gi|3090898|gb|AF056249|AF056249 [3090898] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

AF056248 

Bacteriophage BF23 clone bf23.23.22.a.f, genomic survey sequence 
gij3090897|gb|AF056248|AF056248 [3090897] 

(View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
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AF056247 

Bacteriophage BF23 clone bf23.23.68.r, genomic survey sequence 

gi|3090896|gb|AF056247|AF056247 [3090896] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 

Z50114 

Bacteriophage BF23 DNA for putative tail protein gene 
gi|2464952|emb|Z501 14|BF23LATE [2464952] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, ox\ protein link ) 
D12824 

Bacteriophage BF23 genes for minor tail protein gp24 and major tail protein gp25, complete cds 
gi|520578|dbj|D12824|BBF2TAIL [520578] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 3 nucleotide neighbors ) 
Z34953 

Bacteriophage K3 ip9, ip7 and ip8 genes 
gi|535261|emb|Z34953|MYK3IP978 [535261] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 3 protein links, or 1 nucleotide neighbor ) 
Z35075 

Bacteriophage K3 DNA for Ip3 and Ip4 
gi|535229|emb|Z35075|MYEORF64K [535229] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 2 protein links ) 
X05560 

Bacteriophage K3 gene 38 for receptor recognizing protein 
gi|15112|emb|X05560|MYK3G38 [15112] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
X04747 

Bacteriophage K3 gene 37 for receptor recognizing protein 
gi|15110|emb|X04747|MYK3G37 [15110] 

(View GenBank report,FASTA report, ASN.l report,Graphical vicw,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
X01754 

Bacteriophage K3 tail fiber gene 36 
gi|15108|emb|X01754|MYK3F36 [15108] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 protein links ) 
M16812 

Bacteriophage K3 't* lysis gene, complete cds 
gi|215503|gb|M16812|PK3LYST [215503] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
L46833 

Bacteriophage K3 frd3, frd2 genes, complete cds 
gi|951377|gb|L46833|PK3FRD32G [951377] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 protein links, or 2 nucleotide neighbors ) 
L43613 

Bacteriophage K3 fibritin (wac) gene, complete cds 
gi|903861|gb|L43613|PK3WAC [903861] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l protein link, or 4 nucleotide neighbors ) 
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X01753 

Bacteriophage 0x2 tail fiber gene 36 
gi|15122|emb|X01753|MYOX2F36 [15122] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
L43612 

Bacteriophage 0x2 fibritin (wac) gene, complete cds 
gi|903848|gb|L43612|OX2WAC [903848] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l protein link, or 4 nucleotide neighbors ) 

Z46880 

Bacteriophage 0X2 stp gene 
gi|599663|emb|Z46880|BPOX2STP [599663] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, 1 MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
X05675 

Bacteriophage 0x2 gene 38 for receptor-recognizing protein and flanking regions 
gi| 1 5 1 24|emb|X05675|MYOX2G3 8 [1 5 124] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 3 protein links, or 1 nucleotide neighbor ) 
M33533 

Bacteriophage RBI 8 translational repressor protein (regA) and Orf43.1, complete cds 
gi|216083|gb|M33533|RB18REGA [216083] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
AF033329 

Bacteriophage RBI 8 single-stranded binding protein (gene 32) gene, partial cds, and 5' region 
gi|2645788[gb|AF033329|AF033329 [2645788] 

(View GenBank report,FASTA report,ASN.I report,Graphical view,l protein link, or 1 1 nucleotide neighbors ) 
M86231 

Bacteriophage RB69 gene 62, 3'end; RegA (regA) gene, complete cds 
gi|2 1 5354|gb|M8623 1 |P6962REGA [2 15354] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
AF033332 

Bacteriophage RB69 single-stranded binding protein (gene 32) gene, partial cds, and 5' region 
gi|2645794|gb|AF033332|AF033*332 [2645794] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 12 nucleotide neighbors ) 
U34036 

Bacteriophage RB69 DNA polymerase (43) gene, complete cds 
giil237125|gb|U34036|BRU34036 [1237125] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
V01 145 

Bacteriophage HI genome fragment Each Thymine given in this sequence represents a HMU-residue 
(HMU = 5-hydroxymethyluracil) 
gi|15557|emb|V01 145|PODOHl [15557] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
X05676 

Bacteriophage Ml gene 38 for receptor recognizing protein and flanking regions 
gi| 1 5 1 14|emb|X05676|MYM 1G38 [15 1 14] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 3 protein links, or 1 nucleotide neighbor ) 
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AF034575 

Bacteriophage Ml putative integrase (int) gene, complete cds, and attP region, complete sequence 
gi|2662472|gb|AF034575|AF034575 [2662472] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
AF033321 

Bacteriophage Ml single-stranded binding protein (gene 32) gene, partial cds, and 5* region 
gi|2645772|gb|AF033321|AF033321 [2645772] 

(View GenBank report,FASTA report, ASN.l report,Graphical view.l protein link, or 17 nucleotide neighbors ) 
X55 190 

Bacteriophage Tula 37 and 38 genes for receptor-recognizing proteins 37 and 38 (respectively), partial cds 
gi| 14860|emb|X55 1 90|BPTUIA [ 1 4860] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
AF033334 

Bacteriophage Tulb single-stranded binding protein (gene 32) gene, partial cds, and 5' region 
gi|2645798|gb)AF033334|AF033334 [2645798] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 5 nucleotide neighbors ) 
X55191 

Bacteriophage Tulb 37 gene for receptor-recognizing protein 37 (partial cds), 38 gene for receptor-recognizing protein 38, 
and t gene (partial cds) 
gi|14863|emb|X55191|BPTUIB [14863] 

(View GenBank report 5 FASTA report, ASN.l report,Graphical view.l MEDLINE link, 3 protein links, or 3 nucleotide neighbors ) 
X13065 

Bacteriophage phi80 early region 
gi|14800|emb|X13065|BP80ER [14800] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 8 protein links, or 6 nucleotide neighbors ) 

D00360 
Bacteriophage phi80 cor gene 
gi|217782|dbj|D00360|P8080COR [217782] 

(View GenBank report,FASTA report, ASN.l report,GraphicaI view, or 1 protein link ) 
X01639 

Bacteriophage phi 80 DNA-fragment with replication origin 
gi|15828|emb|X01639jXXPHI80 [15828] 

(View GenBank report,FASTA report^SN.l report,Graphical view,l MEDLINE link, or 25 nucleotide neighbors ) 
X04051 

Lambdoid bacteriophage phi 80 int-xis region (integrase-excisionase region) 
gi| 15770|emblX0405 1|STPHI80X [15770] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
X06751 

Phage Phi80 DNA for major coat protein 
gi| 15768|emb|X0675 1|STPHI80C [15768] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 11 nucleotide neighbors ) 
X75949 

Bacteriophage phi80 DNA for ORF xl71.8 and ORF xl71.28* 
gi|458811|emb|X75949|ECORF171B [458811] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 2 protein links, or 28 nucleotide neighbors ) 
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L40418 

Bacteriophage phi-80 gene, complete cds 
gi|1019107|gbIL40418|P80A [1019107] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
M24831 

Bacteriophage phi-80 Tyr-tRNA gene, 3' end 
gi|215363|gb|M24831|P80TGY [215363] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 43 nucleotide neighbors ) 
Ml 0670 

Bacteriophage phi-80 replication origin 
gi!215361|gb|M10670|P80ORI [215361] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
M24825 

Bacteriophage phi-80 RNA fragment 
gi]215360|gb|M24825|P80M3A [215360] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
M11919 

Bacteriophage phi-80 cl immunity region encoding the N gene 
gi|215358|gb|M11919|P80CI [215358] 

(View GenBank report,FASTA report, ASN.l report, Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
M10891 

Bacteriophage phi-80 attP site DNA 
gi|2 1 535 7|gb|M 1 089 1 |P80ATTP [215357] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 nucleotide neighbor ) 
Ml 9473 

Bacteriophage 933J (from E.coli) proviral Shiga-like toxin type 1 subunits A and B genes, complete cds 
gi| 2 1 5 072|gb|M 1 9473 1 J93 SLTI [2 1 5072] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 2 protein links, or 20 nucleotide neighbors ) 
Y 10775 

Bacteriophage 93 3 W ileX, stx2A and stx2B genes 
gi|1938206|emb|Y10775|BP933ILEX [1938206] 

(View GenBank report,FASTA reporCASN.l report,Graphical view,2 protein links, or 36 nucleotide neighbors ) 
X83722 

Bacteriophage 933W slt-HB gene 
gi|1490229|emb|X83722|B933WSLT [1490229] 

(View GenBank report,FASTA report,ASN,l report,Graphical view,2 protein links, or 20 nucleotide neighbors ) 
X07865 

Bacteriophage 933W slt-II gene for Shiga-like toxin typell subunit A and B 
gi|14892|emb|X07865|BWSLTII [14892] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 protein links, or 29 nucleotide neighbors ) 
M16625 

Bacteriophage H19B (from E.coli) sltlA and sltlB genes encoding Shiga-like toxin I subunits A and B, complete cds 
gi|215043|gb|M16625|H19BSLT [215043] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 24 nucleotide neighbors ) 
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M17358 

Bacteriophage H19B shiga-like toxin- 1 (SLT-1) A and B subunit DNA, complete cds 
gi|215046|gb|M17358|H19BSLTA [215046] 

(View GenBank report,FASTA report,ASRl report,Graphical view,l MEDLINE link, 2 protein links, or 20 nucleotide neighbors ) 
U29728 

Bacteriophage N4 single-stranded DNA-binding protein (N4SSB) gene, complete cds 
gi|939708|gb|U29728|BNU29728 [939708] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,2 MEDLINE links, or 1 protein link ) 
J02580 

Bacteriophage PA-2 (E.coli porcine strain isolate) Rz gene, 5'end; ORF2, outer membrane porin protein (1c) and ORF1 genes, 
complete cds 

gi|2 1 5366|gb| J02580|PA2LC [2 1 5366] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 4 protein links, or 4 nucleotide neighbors ) 
U32222 

Bacteriophage 186, complete sequence 
gij3337249igb|U32222|BlU32222 [3337249] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,6 MEDLINE links, 46 protein links, or 5 nucleotide neighbors ) 
X51522 

Bacteriophage P4 complete DNA genome 
gi|450916|emb|X51522|MYP4CG [450916] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,3 MEDLINE links, 13 protein links, 6 nucleotide neighbors, 
or 1 genome link ) 

X92588 

Bacteriophage 82 orf33, orfl51, orf56, orf96, rus, orf45, and Q genes 
gi| 105 1 1 1 1 |emb|X92588|BAC82HOLL [105 1 1 1 1] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,7 protein links, or 1 nucleotide neighbor ) 
J02803 

Bacteriophage 82 antitermination protein (Q) gene, complete cds 
gij215364igbjJ02803iP82Q [215364] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINElink, or 1 protein link ) 
U02466 

Bacteriophage HK022 (cro), (ell) and (O) genes, complete cds, (P) gene, partial cds 
gi|407285|gb|U02466|BHU02466 [407285] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, 5 protein links, or 1 nucleotide neighbor ) 
M26291 

Bacteriophage D108 regulatory DNA-binding protein (ner) gene, complete cds 
gii 1 66 1 94|gb|M2629 1 |D 1 8NER [ 1 66 1 94] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
Ml 1272 

Bacteriophage D108 left-end DNA 
gi|166193|gb(M11272|D18LEDNA [166193] 

(View GenBank reportJFASTA report,ASN.l report,Graphical view,l MEDLINE link, or 2 nucleotide neighbors ) 
Ml 8902 

Bacteriophage D108 kil gene encoding a replication protein, 3* end; and containing three ORFs, complete cds 
gij 1 66 1 9 1 )gb|M 1 8902|D 1 8KIL [166191] 

(View GenBank report,FASTA report ASN.l report,Graphical view,! MEDLINE link, 1 protein link, or 3 nucleotide neighbors ) 



31 



192 



M10191 

Bacteriophage D108, left end with Mu A protein binding sites LI and L2 
gi|166190|gb|M10191|D18BSL [166190] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 5 nucleotide neighbors ) 
J02447 

bacteriophage dl08 gene a 5' end 
gi|166189|gb|J02447|D18AAA [166189] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
V00865 

Bacteriophage D108 fragment from genes A and ner (C-terminus of ner and N-terminus of A) 
gi|15437|emb|V00865|NCD108 [15437] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 2 protein links ) 
X01914 

Bacteriophage IKe gene for DNA binding protein 
gi|14957|emb|X01914|INIKEDBP [14957] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 

AF064539 
Bacteriophage N15, complete genome 
gi|3192683|gb|AF064539|AF064539 [3 192683] 

(View GenBank report,FASTA report.ASN.l report,Graphical view,2 MEDLINE links, 60 protein links, 26 nucleotide neighbors, 
or 1 genome link ) 

U02303 

Bacteriophage Ifl, complete genome 
gi|3676280|gb|U02303|B2U02303 [3676280] 

(View GenBank report,FASTA report, ASN.l report,Graphicai view, 10 protein links, or 1 genome link ) 
AF007792 

Bacteriophage Mu late morphogenetic region 
gi|355 1775|gb|AF007792|AF007792 [3551775] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 nucleotide neighbor ) 
U24159 

Bacteriophage HP1 strain HPlcl, complete genome 
gi|1046235igb|U24159|BHU24159 [1046235] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,6 MEDLINE links, 41 protein links, 8 nucleotide neighbors, 
or 1 genome link ) 

Z71579 

Bacteriophage S2 type A 5.6 kb DNA fragment 
gi|1679806|emb|Z71579|BPHSlADNA [1679806] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,3 MEDLINE links, 9 protein links, or 9 nucleotide neighbors ) 
X53238 

Klebsiella sp. bacteriophage Kl 1 gene 1 for RNA polymerase 
gi|14984|emb|X53238|KSKl 1RPO [14984] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
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X85010 

Bacteriophage A5 1 1 ply5 1 1 gene 
gi|853748|emb|X85010|BPA511PLY [853748] 

(View GenBank report,FASTA report,ASN.l report, Graphical view,l MEDLINE link, 3 protein links, or 1 nucleotide neighbor ) 
U29728 

Bacteriophage N4 single-stranded DNA-binding protein (N4SSB) gene, complete cds 
gi|939708|gb|U29728|BNU29728 [939708] 

(View GenBank report,FASTA report,ASN.l report,Graphicai view,2 MEDLINE links, or 1 protein link ) 
J02445 

bacteriophage bol 3'-terminal region rna 
gi| 1 66 1 5 2[gb| J02445 iBO 1 TR3 [166152] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 5 nucleotide neighbors ) 
L06183 

Bacteriophage L5 (from Leuconostoc oenos) genome 
gi|289353|gb|L06183|BL5GENM [289353] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 genome link ) 
AF074945 

Mycoplasma arthritidis bacteriophage MAV1, complete genome 
gi|35 i 1243|gb|AF074945|AF074945 [35 1 1243] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,15 protein links, 3 nucleotide neighbors, or 1 genome link ) 
L13696 

Bacteriophage L2 (from Mycoplasma), complete genome 
gi|289338|gb|L13696|BL2CG [289338] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view,3 MEDLINE links, 14 protein links, or 1 genome link ) 
X80191 

Bacteriophage PP7 mRNA for maturation, coat, lysis and replicase proteins 
gi|517237|emblX80191|BPP7PR [517237] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, I MEDLINE link, 4 protein links, or 1 genome link ) 
M19377 

Bacteriophage Pf3 from Pseudomonas aeruginosa (New York strain), complete genome 
gi|2l5380|gb|M19377|PF3COMNY [215380] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 9 protein links, or 5 nucleotide neighbors ) 
M11912 

Bacteriophage Pf3 from Pseudomonas aeruginosa (Nijmegen strain), complete genome 
gi|215371|gb|M11912|PF3COMN [215371] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, 9 protein links, 5 nucleotide neighbors, or 1 
genome link ) 

V00605 

Bacteriophage Pfl gene encoding DNA binding protein 
gi|14970|emb|V00605|INOPFl [14970] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l proteine link, or 1 nucleotide neighbor ) 
L05626 

Bacteriophage PR4 capsid protein (P6) gene, complete cds 
gi|2157351gb|L05626|PR4P6MAJA [215735] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
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D 13409 

Bacteriophage phiCTX (isolated from Pseudomonas aeruginosa) cosR, attP, int genes 
gii217776|dbj|D13409|BPHCOSR [217776] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 MEDLINE link, 3 protein links, or 3 nucleotide neighbors ) 
D 13408 

Bacteriophage phiCTX (isolated from Pseudomonas aeruginosa) cosL, ctx genes 
gi|217775|dbj|D13408|BPHCOSLCTX [217775] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,2 MEDLINE links, or 3 nucleotide neighbors ) 
M24832 

Bacteriophage f2 coat protein gene, partial cds 
gi|166228|gb|M24832|F2CRNACA [166228] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 4 nucleotide neighbors ) 
S72011 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes,partial cds 
gi|2618967|gb|AF017629|AF017629 [2618967] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINElink, 2 protein links, or 44 nucleotide neighbors ) 
AFO 17628 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618964|gb|AF017628|AF017628 [2618964] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINElink, 2 protein links, or 44 nucleotide neighbors ) 
AF017627 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|26 18961 |gb| AFO 1 7627| AFO 1 7627 [26 18961] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINElink, 2 protein links, or 44 nucleotide neighbors ) 
AF017626 

Bacteriophage 21 isocitrate dehydrogenase (icd) gene, partial cds; and integrase (int) gene, partial cds 
gi|2618958|gb|AF017626IAF017626 [2618958] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 49 nucleotide neighbors ) 
AF017625 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|26 1 895 5 |gb| AFO 1 7625| AFO 1 7625 [2618955] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINElink, 2 protein links, or 44 nucleotide neighbors ) 
■ AF017624 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int)genes, partial cds 
gi|2618952|gb|AF017624|AF017624 [2618952] 

(View GenBank report^FASTA report,ASN.l report,Graphical view,l MEDLINElink, 2 protein links, or 44 nucleotide neighbors ) 
AFO 17623 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618949|gb|AF017623|AF017623 [2618949] 

(View GenBank report,FASTA report,ASN.l report, Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017622 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618946|gb|AF017622|AF017622 [2618946] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
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AF01762I 

Bacteriophage 21 isocitxate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi[26 1 8943 |gb[ AFO 1 762 1 1 AFO 1 762 1 [2618943] 

(View GenBank report,FASTA report,ASN.l report, Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
D26449 

Bacteriophage PS 17 FI gene for tail sheath protein (gpFI) and FII gene for tail tube protein (gpFII), complete cds 
gi|452162|dbjiD26449|BPSFIFn [452162] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 2 protein links ) 
X87627 

Bacteriophage D31 12 A and B genes 
gi|974768iemb|X87627|BPD31 12AB [974768] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINElink, 2 protein links, or 1 nucleotide neighbor ) 
U32623 

Bacteriophage D3 transcriptional activator CII (ell) gene, complete cds 
gi|984852|gb|U32623|BDU32623 [984852] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 1 nucleotide neighbor ) 
L34781 

Bacteriophage phi 1 1 holin homologue (ORF3) gene, complete cds and peptidoglycan hydrolase (lytA) gene, partial cds 
gi|5 1 1838jgbiL34781|BPHHOLIN [51 1838] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 4 protein links, or 2 nucleotide neighbors ) 
L14810 

Bacteriophage P22 (gplO) gene, complete cds, and (gp26) gene, complete cds 
gi|294053|gb|L14810|P22GP1026X [294053] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
X87420 

Bacteriophage ESI 8 genes 24, c2, cro, cl, 18, and oL and oR operators 
gi|l 143407|emb|X87420|BPES18GEN [1143407] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,5 protein links, or 9 nucleotide neighbors ) 
L42820 

Bacteriophage BF23 tail protein (hrs) gene, complete cds 
gi|1048680|gb|L428201BBFHRS [1048680] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINElink, 1 protein link, or 1 nucleotide neighbor ) 
X14980 

Bacteriophage PRD1 XV gene for protein PI 5 (lytic enzyme) 
gi|15802|emb|X14980|TEPRD!XV [15802] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINElink, 1 protein link, or 4 nucleotide neighbors ) 
X06321 

Bacteriophage PRD1 gene 8 for DNA terminal protein 
gi|15800iemb|X06321|TEPRD18 [15800] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 10 nucleotide neighbors ) 
X14336 

Filamentous Bacteriophage 12-2 genome 
gi|14920|emb|X14336|INBI22 [14920] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 9 protein links, 1 nucleotide neighbor, or 1 
genome link ) 
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L05001 

Bacteriophage X glucosyl transferase gene, complete cds 
gi|216044|gb|L05001|PXFCLUSYLT [216044] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
M29479 

Bacteriophage p4 sid and psu genes partial cds, and delta gene, complete cds gi|215701| 
gb|M29479|PP4SDP [215701] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,3 protein links, or 4 nucleotide neighbors ) 

SEG_PP4PSUSID 

Bacteriophage P4 capsid size determination protein (sid) gene, 5' end 
gi|2 1 5698|gb||SEG_PP4PSUSID [2 1 5698] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 1 nucleotide neighbor ) 
M29650 

Bacteriophage P4 polarity suppression protein (psu) gene, complete cds 
gi|215697|gb|M29650|PP4PSUSID2 [215697] 

(View GenBank report,FASTA report, ASN.l report, or Graphical view) 
M29651 

C3 Bacteriophage P4 capsid size determination protein (sid) gene, 5' end 
$ gii215696|gb|M29651|PP4PSUSIDl [215696] 

4; (View GenBank report,FASTA report,ASN. 1 report, or Graphical view) 
J M27748 

JJs Bacteriophage P4 gop, beta, and ell genes, complete cds and int gene, 3' end 
j * gi|215691jgb|M27748|PP4GOPBC [215691] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 4 protein links, or 1 nucleotide neighbor ) 

s K02750 

h& Bacteriophage IKe, complete genome 

fll gi|215061|gb|K02750|IKECG [215061] 

CI (View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINElink, 10 protein links, 4 nucleotide neighbors, or 1 

f» I genome link ) 

% L40418 

^ Bacteriophage phi-80 gene, complete cds 
gi|1019107|gb|L404181P80A [1019107] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
AF032122 

Bacteriophage Sfll integrase (int) gene, partial cds; and bactoprenol glucosyl transferase (bgt), and glucosyl tranferase II (gtrll) 
genes,complete cds 

gi|2465412|gb|AF021347|AF021347 [2465412] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINElink, 4 protein links, or 2 nucleotide neighbors ) 
M35825 

Bacteriophage SF6 fragment D lysozyme gene, complete cds 
gi|216105|gb|M358251SF6LYZ [216105] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 protein link ) 

Z35479 
Bacteriophage CI 6 ipl gene 
gi|534936|emb|Z35479|BC16IPl [534936] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 2 nucleotide neighbors ) 
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X12638 

Bacteriophage 21 DNA for gene 2 
gi|296141|emb|X12638|B21GENE2 [296141] 

(View GenBaiik report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
X02501 

Bacteriophage 21 DNA for left end sequence with genes 1 and 2 
gi!15825|emb|X02501|XXPHA21 [15825] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l NiEDLINE link, 2 protein links, or 3 nucleotide neighbors ) 
M65239 

Bacteriophage 21 lysis genes S, R, and Rz, complete cds 
gi|215466|gb|M65239|PH2LYSGEN [215466] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 3 protein links, or 1 nucleotide neighbor ) 
M58702 

Bacteriophage 21 late gene regulatory region 
gi|215465|gb|M58702|PH2LATEGE [215465] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 
M81255 

Bacteriophage 21 head gene operon 
gi|215454|gb|M81255|PH2HEADTL [215454] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 10 protein links, or 4 nucleotide neighbors ) 
M23775 

Bacteriophage 2 1 glycoprotein 1 gene, complete cds, and glycoprotein gene, 5' end 
gi|215451|gb|M23775|PH2GPA [215451] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 3 nucleotide neighbors ) 
M61865 

Bacteriophage 21 excisionase (xis), integrase (int) and isocitrate dehydrogenase (icd), complete cds 
gi|215448|gb|M61865|PH22XISAA [215448] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 protein links, or 9 nucleotide neighbors ) 
S72011 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618967|gb|AF017629|AF017629 [2618967] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017628 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618964!gb|AF017628|AF017628 [2618964] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017627 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gil26 1 896 1 |gbl AFO 1 7627| AFO 1 7627 [2618961] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017626 

Bacteriophage 21 isocitrate dehydrogenase (icd) gene, partial cds; and integrase (int) gene, partial cds 
gi|26 1 895 8|gb| AFO 1 7626| AFO 1 7626 [2618958] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 49 nucleotide neighbors ) 
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AFO 17625 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gii2618955|gb|AF017625|AF017625 [2618955] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017624 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618952|gb|AF017624|AF017624 [2618952] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AFO 17623 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618949|gb|AF017623|AF017623 [2618949] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017622 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618946|gb|AF017622|AF017622 [2618946] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
AF017621 

Bacteriophage 21 isocitrate dehydrogenase (icd) and integrase (int) genes, partial cds 
gi|2618943|gb|AF017621|AF017621 [2618943] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 44 nucleotide neighbors ) 
M57455 

Bacteriophage 42D (clone pDB17) (from Staphylococcus aureus) staphylokinase gene, complete cds 
gi|215344|gb|M57455|P42STK [215344] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view, 1 protein link, or 9 nucleotide neighbors ) 
Y12633 

Bacteriophage 85 DNA, promoter sequence of unknown gene 

gi|2058285|emblY12633|B85PROM [2058285] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 

X98146 

Bacteriophage PI DNA sequence around the Op88 operator 
gill359513|emb|X98146|BP10P880P [1359513] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 nucleotide neighbor ) 
Y07739 

Staphylococcus phage Twort holTW, plyTW genes 
gi|2764979|emblY07739|BPTWGHOLG [2764979] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 2 protein links ) 
L07580 

Bacteriophage phi-1 1 rinA and rin B genes, required for the activation of Staphylococcal phage phi-1 1 int expression 
gi|166160|gb|L07580|BPHRINAB [166160] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 2 protein links) 
M34832 

Bacteriophage phi-1 1 integrase (int) and excisionase (xis) genes, complete cds 
gi|166157|gb|M34832iBPHINTXIS [166157] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 2 protein links, or 2 nucleotide neighbors ) 
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M20394 

Bacteriophage phi-1 1 S.aureus attachment site (attP) 
gi|166156|gb|M20394|BPHATTP [166156] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 4 nucleotide neighbors ) 
X23128 

Bacteriophage phi- 13 integrase gene 
gi|758228|embiX82312iPHI13INT [758228] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, 1 protein link, or 3 nucleotide neighbors ) 
X61719 

S.aureus phi-13 lysogen right chromoson^acteriophage DNA junction 
gi|46625|emb|X61719|SAP13RJNC [46625] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 1 MEDLINE link ) 
X61718 

S.aureus phi-13 lysogen left chromosomal/bacteriophage DNA junction 
gil46624|emb|X61718|SAP13LJNC [46624] 

(View GenBank report,FASTA report,ASN.l report, Graphical view, or 1 MEDLINE link ) 
X61717 

Bacteriophage phi-13 core sequence for attachment 
gi| 1 4799iemb|X6 1 7 1 7|BP 1 3 ATTP [14799] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,2 MEDLINE links, or 3 nucleotide neighbors ) 
U01875 

Bacteriophage phi-13 putative regulatatory region and integrase (int) gene, partial cds 
gi|437118|gb|U01875|U01875 [437118] 

(View GenBank report,FASTA report,ASN. 1 report,Graphical view,3 MEDLINE links, or 4 nucleotide neighbors ) 
X67739 

S.aureus Bacteriophage phi-42 attP gene 
gi|14809|emb|X67739|BPATTPA [14809] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 3 nucleotide neighbors ) 
U01872 

Bacteriophage phi-42 integrase (int) gene, complete cds 
gi|4371 15|gb|U01872|U01872 [4371 15] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,3 MEDLINE links, 2 protein links, or 3 nucleotide neighbors ) 
X94423 

Staphylococcus aureus bacteriophage phi-42 DNA with ORFs (restriction modification system) 
giI1771597|emb|X94423|SARMS [1771597] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 protein links, or 1 nucleotide neighbor ) 
M27965 

Bacteriophage L54a (from S.aureus) int and xis genes, complete cds 
gi|215096|gb|M27965|L54INTXIS [215096] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, MEDLINE 1 link, 2 protein links, or 3 nucleotide neighbors ) 
U72397 

Bacteriophage 80 alpha holin and amidase genes, complete cds 
gi|1763241|gbiU72397|B8U72397 [1763241] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 protein links, or 2 nucleotide neighbors ) 
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AB009866 

Bacteriophage phi PVL proviral DNA, complete sequence 
gi|3341907|dbj|AB009866|AB009866 [3341907] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, 63 protein links, or 1 nucleotide neighbor ) 
Z47794 . 

Bacteriophage Cp-1 DNA, complete genome 
gi|2288892|emb|Z47794|BPCPlXX [2288892] 

(View GenBank report,FASTA report,ASN,l report,Graphicai view,3 MEDLINfi links, 28 protein links, 1 nucleotide neighbor, or 
1 genome link ) 

SEG_CP7RSIT 

Bacteriophage Cp-7 (S. pneumoniae) 5* inverted terminal repeat 
gi|166186|gb||SEG_CP7RSIT [166186] 

(View GenBank report,FASTA report,ASN.l report, Graphical view, or 1 MEDLINE link ) 
Ml 1635 

Bacteriophage Cp-7 (S.pneumoniae) DNA, 3' inverted terminal repeat 
gi|166185|gb|Ml 1635|CP7RSIT2 [166185] 

(View GenBank report,FASTA report,ASN.l report, or Graphical view) 
Ml#36 

Bacteriophage Cp-7 (S.pneumoniae) 5' inverted tenninal repeat 
jft66184|gb|Ml 1636|CP7RSIT1 [166184] 

(Slew GenBank report,FASTA report,ASN.l report, or Graphical view) 

SEgj_CP5RSIT 

Bacteriophage Cp-5 (S.pneumoniae), 5' inverted terminal repeat 
|ft66181|gb||SEG_CP5RSIT [166181] 

(View GenBank report,FASTA report, ASN.l report,Graphical view, or 1 MEDLINE link ) 
MflS33 

Bacteriophage Cp-5 (S.pneumoniae) 3* inverted terrninal repeat 
||166180|gb|M116331CP5RSIT2 [166180] 

Qgiew GenBank report,FASTA report, ASN.l report, or Graphical view) 
Ml%34 

Bacteriophage Cp-5 (S.pneumoniae), 5* inverted terminal repeat 
giil66179|gb|M11634|CP5RSITl [166179] 

(View GenBank report,FASTA reportsASN.l report, or Graphical view) 
M34780 

Bacteriophage Cp-9 muramidase (cpl9) gene 
gi|166187|gb|M34780|CP9CPL [166187] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 1 protein link, or 1 nucleotide neighbor ) 
M34652 

Bacteriophage HB-3 amidase (hbl) gene, complete cds 
gi|215055|gbIM34652|HB3HBLA [215055] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
U64984 

Streptococcus pyogenes phage T12 repressor, excisionase (xis), integrase(int) and erythrogenic toxin A precursor (speA) genes, 
complete cds giil877426|gb|U40453|SPU40453 [1877426] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE links, 4 protein links, or 22 nucleotide neighbors ) 
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X12375 

Phage CP-T1 (Vibrio cholerae) DNA for packaging signal (pac site) 
giil5435|emb|X12375|NCCPPAC [15435] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,l MEDLINE link, or 1 protein link ) 
AF087814 

Vibrio cholerae filamentous bacteriophage fs-2 DNA, complete genome sequence 
gi|3702207|dbj|AB002632|AB002632 [3702207] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 9 protein links, or 1 genome link ) 
D83518 

Bacteriophage KVP40 gene for major capsid protein precursor, complete cds 
gi|3046858|dbj|D83518|D83518 [3046858] 

(View GenBank reportJFASTA report, ASN.l report,Graphical view, 1 MEDLINE link, or 1 protein link ) 
AF033322 

Bacteriophage PST single-stranded binding protein (gene 32) gene, partial cds, and 5* region 
gi|2645774|gb|AF033322|AF033322 [2645774] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l protein link, or 17 nucleotide neighbors ) 
m X94331 

tg\ Bacteriophage L cro, 24, c2, and cl genes 

J gi| 14692 13|emb|X94331|BLCR024C [1469213] 

Jfjj (View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, or 4 protein links ) 
J U82619 

? Shigella flexneri bacteriophage V glucosyl transferase (gtr), integrase (int) and excisionase (xis) genes, complete eds 

yi gi|2465470|gb|U82619|SFU82619 [2465470] 

Til (View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE link, 8 protein links, or 1 nucleotide neighbor ) 

!■: y 
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Table 12 



NCBI Entrez Nucleotide QUERY 
Key words: bacteriophage and lysis 
56 citations found (all selected) 



AJ011581 

Bacteriophage PS 119 lysis genes 13, 19, 15, and packaging gene 3, 
complete cds 

gif3676084iemblAJ011581IBPS011581 [3676084] 

(View GenBank report,FASTA report>ASN.l report,Graphical view,4 protein 
links, or 1 nucleotide neighbor ) 

AJ011580 

Bacteriophage PS34 lysis genes 13, 19, 15, anti terminator gene 23, and 
packaging gene 3, complete cds 
gil3676078lembiAJ011580IBPS011580 [3676078] 

(View GenBank report,FASTA report ASN.l report,Graphical view,5 protein 
links, or 2 nucleotide neighbors ) 



AJ011579 

Bacteriophage PS3 lysis genes 13, 19, 15, and packaging gene 3 
gil3676073lemblAJ011579IBPS011579 [3676073] 

(View GenBank report,FASTA report .ASN.l report,Graphical view,4 protein 
links, or 1 nucleotide neighbor ) 



AF034975 

Bacteriophage H-19B essential recombination function protein (erf), kil 
protein (kil), regulatory protein cIII (cIII), protein gp!7 (17), N 
protein (N), cl protein (cl), cro protein (cro), ell protein (ell), O 
protein (O), P protein (P), ren protein (ren), Roi (roi), Q protein (Q), 
Shiga-like toxin A (slt-IA) and B (slt-IB) subunits, and putative holin 
protein (S) genes, complete cds 
gil2668751lgblAF034975l [2668751] 

(View GenBank report,FASTA reportASN.l report,Graphical view.l MEDLINE 
link, 20 protein links, or 30 nucleotide neighbors ) 



U37314 

Bacateriophage lambda Rzl protein precursor (Rzl) gene, complete cds 
gill017780lgblU37314IBLU37314 [1017780] 

(View GenBank report,FASTA report ,ASN.l report,Graphical view,2 MEDLINE 
links, 1 protein link, or 9 nucleotide neighbors ) 



U00005 

E. coli hflA locus encoding the hflX, hflK and hflC genes, hfq gene, 
complete cds; miaA gene, partial cds 
gi!436153IgbiU00005IECOHFLA [436153] 

(View GenBank reportJFASTA reportASN.l report,Graphical view,4 MEDLINE 
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links, 5 protein links, or 8 nucleotide neighbors ) 



U32222 

Bacteriophage 186, complete sequence 
gil3337249IgblU32222IB 1U32222 [3337249] 

(View GenBank report,FASTA report ASN.l report,Graphical view,6 MEDLINE 
links, 46 protein links, or 5 nucleotide neighbors ) 



AF064539 

Bacteriophage N15, complete genome 
gii3192683lgblAF0645391AF064539 [3192683] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,2 MEDLINE 
links, 60 protein links, 26 nucleotide neighbors, or 1 genome link ) 



AF063097 

Bacteriophage P2, complete genome 
gil3139086lgb!AF063097IAF063097 [3139086] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,21 MEDLINE 
links, 42 protein links, 3 nucleotide neighbors, or 1 genome link ) 



Z97974 

Bacteriophage phiadh lys, hoi, intG, rad,and tec genes 
gi!2707950lemblZ97974!BPHIADH [2707950] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE 
links, 9 protein links, or 1 nucleotide neighbor ) 



AF059243 

Bacteriophage NL95, complete genome 
gil3088545igb!AF059243IAFO59243 [3088545] 

(View GenBank report,FASTA report.ASN.1 report,Graphical view,2 MEDLINE 
links, 4 protein links, 3 nucleotide neighbors, or 1 genome link ) 



AF052431 

Bacteriophage Mil A-protein, coat protein, A 1-protein, and replicase 
genes, complete cds 
gi!2981208tgblAF052431l [2981208] 

(View GenBank report,FASTA report^SN.l report,Graphical view,2 MEDLINE 
links, 4 protein links, or 8 nucleotide neighbors ) 



Y07739 

Staphylococcus phage Twort holTW, plyTW genes 
gii2764979lemblY07739IBPTWGHOLG [2764979] 
(View GenBank report,FASTA report ASN.l report,Graphical view, or 2 
protein links ) 



X94331 
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Bacteriophage L cro, 24, c2, and cl genes 

gii 14692 13lembtX94331IBLCR024C [1469213] 

(View GenBank report,FASTA reporCASN.l report,Graphical view,l MEDLINE 
link, or 4 protein links ) 



X78410 

Bacteriophage phiadh holin and lysin genes 
gil793848lemblX784101LGHOLLYS [793848] 

(View GenBank report,FASTA report,ASN.l report,Graphicalyiew 9 l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 



X99260 

Bacteriophage B103 genomic sequence 
gill429229!emb!X99260IBB103G [1429229] 

(View GenBank report,FASTA report ,ASN.l report,Graphical view,l MEDLINE 
link, 17 protein links, or 12 nucleotide neighbors ) 



AJ000741 

Bacteriophage PI darA operon 
gii2462938lemWAJ000741IBPAJ7641 [2462938] 

(View GenBank reportJFASTA report,ASN.l report,Graphical view,l MEDLINE 
link, 10 protein links, or 3 1 nucleotide neighbors ) 



X87420 

Bacteriophage ES18 genes 24, c2, cro, cl, 18, and oL and oR operators 
gi!1143407lembtX87420IBPES18GEN [1143407] 

(View GenBank reportJFASTA report ,ASN.l report,Graphical view,5 protein 
links, or 9 nucleotide neighbors ) 



L35561 

Bacteriophage phi-105 ORFs 1-3 
gil532218[gblL35561iPH50RFBrTR [532218] 

(View GenBank report,FASTA report.ASN.l report,Graphical view,l MEDLINE 
link, or 3 protein links ) 



D10027 

Group II RNA coliphage GA genome 
gil2177841dbjlD10027!PGAXX [217784] 

(View GenBank report,FASTA reportASN.l report,Graphical view,l MEDLINE 
link, 3 protein links, 5 nucleotide neighbors, or 1 genome link ) 



V01 128 

Bacteriophage phi-X174 (cs70 mutation) complete genome 
gill5535iemblV01128IPHIX174 [15535] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,4 MEDLINE 
links, 1 1 protein links, or 26 nucleotide neighbors ) 
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S81763 

coat gene...replicase gene [bacteriophage KU1, host=Escherichia coli, 
group II RNA phage, Genomic RNA, 3 genes, 120 nt] 
gill438766lgblS81763IS81763 [1438766] 

(View GenBank report,FASTA report .ASN.l report,Graphical view, or 1 
MEDLINE link) 



U38906 

Bacteriophage rlt integrase, repressor protein (rro), dUTPase, holin and 

lysin genes, complete cds 

gi! 13535171gblU389061BRU38906 [1353517] 

(View GenBank report,FASTA report >ASN.l report,Graphical view,2 MEDLINE 
links, 50 protein links, or 3 nucleotide neighbors ) 



X91149 

Bacteriophage phi-C31 DNA cos region 
gilll07473lemblX91149iAPHIC31C [1107473] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
link, 6 protein links, or 1 nucleotide neighbor ) 



V00642 

phage MS2 genome 
g ill5081lemblV00642iLEMS2X [15081] 

(View GenBank report,FASTA report^ASN.! report,Graphical view,8 MEDLINE 
links, 4 protein links, or 20 nucleotide neighbors ) 



V01 146 

Genome of bacteriophage T7 
gil431187lemblV0H46iT7CG [431187] 

(View GenBank report,FASTA reportASN.l report,Graphical view, 13 MEDLINE 
links, 60 protein links, 105 nucleotide neighbors, or 1 genome link ) 



X78401 

Bacteriophage P22 right operon, orf 48, replication genes 18 and 12, nin 
region genes, ninG phosphatase, late control gene 23, orf 60, complete 
cds, late control region, start of lysis gene 13 
gi!512343lembiX78401iPOP22NIN [512343] 

(View GenBank report,FASTA repartASN.l report,Graphical view,2 MEDLINE 
links, 13 protein links, or 4 nucleotide neighbors ) 



Y00408 

Bacteriophage T4 gene t for lysis protein 
gi!15368lemblY00408IMYT4T [15368] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
link, 1 protein link, or 3 nucleotide neighbors ) 



Z26590 
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Bacteriophage mv4 lysA and lysB genes 
gi!410500lemblZ26590IMV4LYSAB [410500] 

(View GenBank report,FASTA report,ASN.l report,Graphical view, or 4 
protein links ) 



X07809 

Phage phiX174 lysis (E) gene upstream region 
gi!150941emblX078091MIPHIXE [15094] 

(View GenBank report,FASTA report,ASNl report,Graphical view.l MEDLINE 
link, 2 protein links, or 4 nucleotide neighbors ) 



Z34528 

Lactococcal bacteriophage c2 lysin gene 
gil506455lemblZ34528lLBC2LYSIN [506455] 

(View GenBank report,FAST A report ASN.l report,Graphical view,l MEDLINE 
link, 1 protein link, or 4 nucleotide neighbors ) 



X15031 

Bacteriophage fr RNA genome 
gill5071lembiX150311LEBFRX [15071] 

(View GenBank report,FAST A reportASN.l report,Graphical view,l MEDLINE 
link, 4 protein links, 9 nucleotide neighbors, or 1 genome link ) 



X80191 

Bacteriophage PP7 mRNA for maturation, coat, lysis and replicase 
proteins 

g il517237iemblX80191iBPP7PR [517237] 

(View GenBank report,FASTA report^ASRl report,Graphical view,l MEDLINE 
link, 4 protein links, or 1 genome link ) 

X85010 

Bacteriophage A511 plySll gene 
giI853748!emblX85010IBPA511PLY [853748] 

(View GenBank report JASTA report,ASN.l report,Graphical view,l MEDLINE 
link, 3 protein links, or 1 nucleotide neighbor ) 



X85009 

Bacteriophage A500 holSOO and ply500 genes 
giI853744!emblX85009IBPA500PLY [853744] 

(View GenBank report,FAST A report ASN.l report,Graphical view,l MEDLINE 
link, 3 protein links, or 4 nucleotide neighbors ) 



X85008 

Bacteriophage A 1 18 holl 18 and ply 1 18 genes 
gi!853740lemb!X85008IBPA118PLY [853740] 

(View GenBank report,FASTA reporiASN.l report,Graphical view,l MEDLINE 
link, 3 protein links, or 1 nucleotide neighbor ) 
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Z35638 

Bacteriophage phi-X174 genes for lysis protein and beta-lactamase 
gil520996lemblZ35638IBPLYSPR [520996] 

(View GenBank reportJFASTA report ASN.l report,Graphical view.l MEDLINE 
link, 2 protein links, or 516 nucleotide neighbors ) 



J02459 

Bacteriophage lambda, complete genome 
gil215104lgbIJ02459ILAMCG [215104] 

(View GenBank report,FASTA report, ASN.l report,Graphical view,87 MEDLINE 
links, 67 protein links, 190 nucleotide neighbors, or 1 genome link ) 



X87674 

Bacteriophage PI lydA & lydB genes 
gil9747631emb!X87674!BACPlLYD [974763] 

(View GenBank report,FASTA report ASN.l report,Graphical view, 1 MEDLINE 
link, 2 protein links, or 2 nucleotide neighbors ) 



X87673 

Bacteriophage PI gene 17 
gi!974761lemb!X87673IBACP117 [974761] 

(View GenBank reportJFASTA report,ASN.l report.Graphical view,l MEDLINE 
link, 1 protein link* or 1 nucleotide neighbor ) 



M14784 

Bacteriophage T3 strain amNG220B right end, tail fiber protein, lysis 
protein and DNA packaging proteins, complete cds 
gi!215810lgblM147841Pr3RE [215810] 

(View GenBank reportJFASTA reporUASN.l report,Graphical view,l MEDLINE 
link, 9 protein links, or 10 nucleotide neighbors ) 



Ml 1813 

Bacteriophage PZA (from B.subtilis), complete genome 
gi!216046igblMl 1813IPZACG [216046] 

(View GenBank reportJFASTA report ASN.l report,Graphical view3 MEDLINE 
links, 27 protein links, 17 nucleotide neighbors, or 1 genome link ) 



M16812 

Bacteriophage K3 V lysis gene, complete cds 
gi!215503lgbiM16812IPK3LYST [215503] 

(View GenBank reportJFASTA reporUASN.l report,Graphical view,l MEDLINE 
link, 1 protein link, or 4 nucleotide neighbors ) 



J04356 

Bacteriophage P22 proteins 15 (complete cds), and 19 (3 r end) genes 
gil215265igb!J04356!P2215P [215265] 
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(View GenBank report,FASTA reporUASN.l report,Graphical view.l MEDLINE 
link, 3 protein links, or 2 nucleotide neighbors ) 



J04343 

Bacteriophage JP34 coat and lysis protein genes, complete cds, and 
replicase protein gene, 5' end 
gil215076lgblJ04343IJP3COLY [215076] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
link, 3 protein links, or 2 nucleotide neighbors ) 



J02482 

Bacteriophage phi-X174, complete genome 
gil216019igblJ02482iPXlCG [216019] 

(View GenBank report.FASTA report,ASN.l report,Graphical view,23 MEDLINE 
links, 1 1 protein links, 26 nucleotide neighbors, or 1 genome link ) 



M99441 

Bacteriophage T4 anti-sigma 70 protein (asiA) gene, complete cds and 
lysis protein, 3* end 

gil215820lgbIM99441iPT4ASIA [215820] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,3 MEDLINE 
links, 2 protein links, or 2 nucleotide neighbors ) 



M65239 

Bacteriophage 21 lysis genes S, R, and Rz, complete cds 
gil215466lgblM65239!PH2LYSGEN [215466] 

(View GenBank report,FASTA report^ASN.l report,Graphical view,l MEDLINE 
link, 3 protein links, or 1 nucleotide neighbor ) 



M10637 

Phage G4 D/E overlapping gene system, encoding D (morphogenetic) and E 
(lysis) proteins 

gil215427lgb!M10637IPG4DE [215427] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE 
link, 2 protein links, or 12 nucleotide neighbors ) 



J02454 

Bacteriophage G4, complete genome 
gil215415lgblJ02454IPG4CG [215415] 

(View GenBank report,FASTA report ASN.l report,Graphical view,6 MEDLINE 
links, 1 1 protein links, 20 nucleotide neighbors, or 1 genome link ) 



J02580 

Bacteriophage PA-2 (E.coli porcine strain isolate) Rz gene, 5'end; ORF2, 
outer membrane porin protein Qc) and ORF1 genes, complete cds 
gil215366IgbiJ02580IPA2LC [215366] 

(View GenBank report,FASTA reporCASN.l report,Graphical view.l MEDLINE 
link, 4 protein links, or 4 nucleotide neighbors ) 
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M14782 

Bacillus phage phi-29 head morphogenesis, major head protein, head fiber 

protein, tail protein, upper collar protein, lower collar protein, 

pre-neck appendage protein, morphogenesis(13), lysis, morphogenesis(15), 

encapsidation genes, complete cds 

g il215323lgblM14782iP29LATE2 [215323] 

(View GenBankreport,FASTA report^SN.l report,Graphical view,l MEDLINE 
link, 1 1 protein links, or 1 1 nucleotide neighbors ) 



M10997 



Bacteriophage P22 lysis genes 13 and 19, complete cds 
gil215262!gbiM109971P221319 [215262] . 
(View GenBank report,FASTA reportASN.l report,Graphical view,l MEDLINE 
link, 2 protein links, or 3 nucleotide neighbors ) 



J02467 

Bacteriophage MS2, complete genome 

gil215232lgblJ024671MS2CG [215232] . o.^txtt, 

(View GenBank report,FASTA report^SN.l report,Graphical view,8 MEDLINE 
links, 4 protein links, 20 nucleotide neighbors, or 1 genome link ) 



M14035 

Bacteriophage lambda lysis S gene with mutations leading to nonlethality 
of S in the plasmid pRGl 
gi!215180!gblM14035ILAMLYS [215180] 

(View GenBank reportJFASTA report,ASN.l report,Graphical view,l MEDLINE 
link, 1 protein link, or 14 nucleotide neighbors ) 



U04309 



Bacteriophage phi-LC3 putative holin (lysA) gene and putative murein 
hydrolase (lysB) gene, complete cds 
gii530796IgbiU04309IBPU04309 [530796] 

(View GenBank report,FASTA report^SN.l report,Graphical view.l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 
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Table 13 



NCBI Entrez Nucleotide QUERY 

Key word: holin 

51 citations found (all selected) 



AF034975 

Bacteriophage H-19B essential recombination function protein (erf), kil 
protein (kil), regulatory protein cIII (cIII), protein gpl7 (17), N 
protein (N), cl protein (cl), cro protein (cro), ell protein (ell), 0 
protein (O), P protein (P), ren protein (ren), Roi (roi), Q protein (Q), 
Shiga-like toxin A (slt-IA) and B (slt-IB) subunits, and putative holin 
protein (S) genes, complete cds 
gil2668751lgb!AF034975! [2668751] 

(View GenBank report,FASTA report ASN.l report.Graphical view.l MEDLINE 
link, 20 protein links, or 30 nucleotide neighbors ) 

U52961 

Staphylococcus aureus holin-like protein LrgA (lrgA) and LrgB (lrgB) 
genes, complete cds 

gill841516lgblU52961ISAU52961 [1841516] 

(View GenBank report,FASTA report ASN.l report.Graphical view.l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 



U28154 

Haemophilus somnus cryptic prophage genes, capsid scaffolding protein 
gene, partial cds, major capsid protein precursor, endonuclease, capsid 
completion protein, tail synthesis proteins, holin, and lysozyme genes, 
complete cds 

gill765928lgblU28154IHSU28154 [1765928] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,l MEDLINE 
link, or 13 protein links ) 



AF032122 

Streptococcus thermophilus bacteriophage Sfil9 central region of genome 
gil2935682lgblAF032122l [2935682] 

(View GenBank report,FASTA report ,ASN.l report,Graphical view.l MEDLINE 
link, 14 protein links, or 2 nucleotide neighbors ) 



AF032121 

Streptococcus thermophilus bacteriophage Sfi21 central region of genome 
gil2935667lgblAF032121IAF032121 [2935667] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
link, 14 protein links, or 2 nucleotide neighbors ) 
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AF021803 

Bacillus subtilis 168 prophage SPbeta N-acetylmuramoyl-L-alanine amidase 
(blyA), holin-like protein (bhlA), holin-like protein (bhlB), and yolK 
genes, complete cds; and yolJ gene, partial cds 
gil2997594lgblAF021803IAF0218(B [2997594] 

(View GenBank report,FASTA report ASN.l report,Graphical view.l MEDLINE 
link, 5 protein links, or 1 nucleotide neighbor ) 

AF057033 

Streptococcus thermophilus bacteriophage sfil 1 gp502 (orf502), gp284 
(orf284), gpl29 (orf 129), gpl93 (orf 193), gpl 19 (orf 1 19), gp348 
(orB48), gp53 (orf53), gpl 13 (orf 113), gpl04 (orf 104), gpl 14 (orf 114), 
gpl28 (orf 128), gpl68 (orf 168), gpl 17 (orf 1 17), gpl05 (orf 105), putative 
minor tail protein (orf 15 10), putative minor structural protein 
(orf512), putative minor structural protein (orf 1000), gp373 (orf373), 
gp57 (orf57), putative anti-receptor (orf695), putative minor structural 
protein (orf669), gpl49 (orf 149), putative holin (orf 141), putative 
holin (orf87), and lysin (orf288) genes, complete cds 
gil3320432lgblAF057033IAF057033 [3320432] 

(View GenBank report,FASTA report,ASN.l report.Graphical view,25 protein 
links, or 1 nucleotide neighbor ) 

U32222 

Bacteriophage 186, complete sequence 

g il3337249lgblU32222iB 1U32222 [3337249] t 

(View GenBank report,FASTA report ASN.l report,Graphical view,6 MEDLINE 

links, 46 protein links, or 5 nucleotide neighbors ) 

AB009866 

Bacteriophage phi PVL proviral DNA, complete sequence 
gi!3341907ldbjlAB009866IAB009866 [3341907] 

(View GenBank report,FASTA report ASN.l report,Graphical view,63 protein 
links, or 1 nucleotide neighbor ) 



AF009630 

Bacteriophage WL170, complete genome 
gil3282260lgblAF009630IAF009630 [3282260] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,63 protein 
links, 3 nucleotide neighbors, or 1 genome link ) 



AF064539 

Bacteriophage N15, complete genome 
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gil3192683lgblAF064539IAF064539 [3192683] 

(View GenBank report,FASTA report ASN.l report,Graphical view,2 MEDLINE 
links, 60 protein links, 26 nucleotide neighbors, or 1 genome link ) 



AF063097 

Bacteriophage P2, complete genome 

gii3 139086lgblAF063097IAF063097 [3 139086] 

(View GenBank report,FASTA report ASN.l report,Graphical view,21 MEDLINE 
links, 42 protein links, 3 nucleotide neighbors, or 1 genome link ) 



Z97974 

Bacteriophage phiadh lys, hoi, intG, rad,and tec genes 
gil2707950lemblZ97974IBPHIADH [2707950] 

(View GenBank report,FASTA report ASN.l report,Graphical view,2 MEDLINE 
links, 9 protein links, or 1 nucleotide neighbor) 



X95646 

Streptococcus thermophilus bacteriophage Sfi21 DNA; lysogeny module, 
8141 bp 

gil2292747iemblX95646!BSFI21LYS [2292747] 

(View GenBank reportJFASTA report,ASN.l report,Graphical view,2 MEDLINE 
links, 19 protein links, or 3 nucleotide neighbors ) 



SEG_LLHLYSIN0 

Bacteriophage LL-H structural protein gene, partial cds; minor 
structural protein gp61 (g57), unknown protein, unknown protein, 
structural protein (g20), unknown protein, unknown protein, major capsid 
protein (g34), main tail protein gpl9 (gl7), holin (hoi), muramidase 
(mur), unknown protein, unknown protein, unknown protein, unknown 
protein, unknown protein, and unknown protein genes, complete cds; 
unknown protein gene, partial cds; and unknown protein, unknown protein, 
unknown protein, unknown protein, unknown protein, minor structural 
protein gp75 (g70), minor structural protein gf&9 (g88), minor 
structural protein gp58 (g71), unknown protein, unknown protein, unknown 
protein, and unknown protein genes, complete cds 
gill004337lgbliSEGJXHLYSIN0 [1004337] 

(View GenBank report,FASTA report >ASN.l report,Graphical view,4 MEDLINE 
links, 31 protein links, or 1 nucleotide neighbor) 



M96254 

Bacteriophage LL-H holin (hoi), muramidase (mur), and unknown protein 
genes, complete cds 

giil004336!gb!M%254ILLHLYSIN03 [1004336] 

(View GenBank reportJFASTA report ASN.l report, or Graphical view) 
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Y07740 

Staphylococcus phage 187 ply 187 and hoi 187 genes 
gil2764982lemblY07740IBP187PLYH [2764982] 
(View GenBank reportJFASTA report ASN.l report.Graphical view, or 2 
protein links ) 



U88974 

Streptococcus thermophilus bacteriophage 01205 DNA sequence 
gil2444080lgblU88974l [2444080] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE 
link, 57 protein links, or 6 nucleotide neighbors ) 



Z99117 

Bacillus subtilis complete genome (section 14 of 21): from 2599451 to 
2812870 

gil2634966lemblZ99117IBSUB0014 [2634966] 

(View GenBank report.FASTA report ASN.l report,Graphical view,233 

protein links, 51 nucleotide neighbors, or 1 genome link ) 



Z99115 

Bacillus subtilis complete genome (section 12 of 21): from 2195541 to 
2409220 

gil2634478iemblZ99115IBSUB0012 [2634478] ' 

(View GenBank report,FASTA report,ASN.l report,Graphical view ,244 

protein links, 64 nucleotide neighbors, or 1 genome link ) 



Z99110 

Bacillus subtilis complete genome (section 7 of 21): from 1 194391 to 
1411140 

gil2633472lemblZ99110IBSUB0007 [2633472] 

(View GenBank report,FASTA report ASN.l report,Graphical view,226 

protein links, 31 nucleotide neighbors, or 1 genome link ) 



X78410 

Bacteriophage phiadh holin and lysin genes 
gil793848lemblX78410ILGHOLLYS [793848] 

(View GenBank report,FASTA report ASN.l report,Graphical view.l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 



Z93946 
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Bacteriophage Dp-1 dph and pal genes and 5 open reading frames 

gil 1934760!emblZ93946IBPDPlORFS [ 1934760] 

(View GenBankreport,FASTA report ASN.l report,Graphical view, or 6 

protein links ) 



AF011378 

Bacteriophage ski complete genome 
gil2392824lgblAF011378IAF011378 [2392824] 

(View GenBank report,FASTA reportASN.l report.Graphical view ,54 protein 
links, 2 nucleotide neighbors, or 1 genome link ) 



Z47794 

Bacteriophage Cp-1 DNA, complete genome 
gil2288892lemb!Z47794IBPCPlXX [2288892] 

(View GenBank report,FASTA reportASN.l report,Graphical view,3 MEDLINE 
links, 28 protein links, 1 nucleotide neighbor, or 1 genome link ) 



L35561 

Bacteriophage phi- 1 05 ORFs 1-3 
gil532218lgb!L35561!PH50RFHTR [532218] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
link, or 3 protein links ) 



D49712 

Bacillus licheniformis DNA for ORFs, xpaL2 homologous protein and xpaLl 
homologous protein, complete and partial cds 
gill514423!dbjlD49712ID49712 [1514423] 

(View GenBank report,FASTA report ASN.l report,Graphical view,2 MEDLINE 
links, or 4 protein links ) 



X90511 

Lactobacillus bacteriophage phigle DNA for Rorfl62, Holin, Lysin, and 
Rorf 175 genes 

gil 1926386lemblX9051 HLBPfflHOL [ 1926386] 

(View GenBank reportJFASTA reportASN.l report,Graphical view ,4 protein 
links, or 1 nucleotide neighbor ) 



X98106 

Lactobacillus bacteriophage phigle complete genomic DNA 
gill926320lemblX98106ILBPHIGlE [1926320] 

(View GenBank report,FASTA report,ASN.l report,Graphical view.l MEDLINE 
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link, 50 protein links, or 4 nucleotide neighbors ) 



U72397 

Bacteriophage 80 alpha holin and amidase genes, complete cds 
gil 1763241lgblU72397IB8U72397 [ 1763241] 

(View GenBank report,FASTA report ASN.l report,Graphical view,2 protein 
links, or 2 nucleotide neighbors ) 



U38906 

Bacteriophage rlt integrase, repressor protein (rro), dUTPase, holin and 

lysin genes, complete cds 

gil 1353517lgblU38906IBRU38906 [ 1353517] 

(View GenBank report,FASTA report ASN.l report,Graphical view,2 MEDLINE 
links, 50 protein links, or 3 nucleotide neighbors ) 



X91149 

Bacteriophage phi-C3 1 DNA cos region 

gil 1 107473lemblX91 149IAPHIC31C [1 107473] 

(View GenBank report,FASTA reportASN.l report.Graphical view,l MEDLINE 
link, 6 protein links, or 1 nucleotide neighbor ) 



U24159 

Bacteriophage HP1 strain HPlcl, complete genome 
gill046235lgblU24159IBHU24159 [1046235] 

(View GenBank report,FASTA reportASN.l report,Graphical view,6 MEDLINE 
links, 41 protein links, 8 nucleotide neighbors, or 1 genome link ) 



Z26590 

Bacteriophage mv4 lysA and lysB genes 
gil410500lemblZ26590IMV4LYSAB [410500] 

(View GenBank report,FASTA reportASN.l report,Graphical view, or 4 
protein links ) 



Z70177 

B.subtilis DNA (28 kb PBSX/skin element region) 
gill225934lemblZ70177IBSPBSXSE [1225934] 

(View GenBank reportJFASTA reportASN.l report,Graphical view32 protein 
links, or 4 nucleotide neighbors ) 



Z36941 
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B.subtilis defective prophage PBSX xhlA, xhlB, and xylA genes 
gil535793lemblZ36941!BSPBSXXHL [535793] 

(View GenBank report,FASTA report ASN.l report.Graphical view,4 protein 
links, or 5 nucleotide neighbors ) 



X89234 

L.innocua DNA for phagelysin and holin gene 
gilll34844lemblX89234ILICPLYHOL [1134844] 

(View GenBank report.FASTA report,ASN.l report.Graphical view,l MEDLINE 
link, 2 protein links, or 4 nucleotide neighbors ) 



X85010 

Bacteriophage A51 1 ply51 1 gene 
gi!853748lemblX85010IBPA51 1PLY [853748] 

(View GenBank report,FASTA report ,ASN.l report,Graphical view.l MEDLINE 
link, 3 protein links, or 1 nucleotide neighbor ) 



X85009 

Bacteriophage A500 hol500 and ply500 genes 
gi!853744lemblX85009IBPA500PLY [853744] 

(View GenBank report,FASTA report ASN.l report,Graphical view.l MEDLINE 
link, 3 protein links, or 4 nucleotide neighbors ) 



X85008 

Bacteriophage Al 18 holl 18 and plyl 18 genes 
gil853740lemblX85008IBPA118PLY [853740] 

(View GenBank report,FASTA report ,ASN.l report,Graphical view.l MEDLINE 
link, 3 protein links, or 1 nucleotide neighbor ) 



L34781 

Bacteriophage phi 11 holin homologue (ORF3) gene, complete cds and 

peptidoglycan hydrolase (lytA) gene, partial cds 

gil51 1838lgblL34781IBPHHOLIN [51 1838] . 

(View GenBank report,FASTA reportASN.l report,Graphical view.l MEDLINE 

link, 4 protein links, or 2 nucleotide neighbors ) 



U11698 



Serratia marcescens SM6 extracellular secretory protein (nucE), putative 
phage lysozyme (nucD), and transcriptional activator (nucC) genes, 
complete cds 

gil509550lgblUl 1698ISMU1 1698 [509550] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
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link, 3 protein links, or 1 nucleotide neighbor ) 



U31763 

Serratia marcescens phage-holin analog protein (regA), putative phage 
lysozyme (regB), and transcriptional activator (regC) genes, complete 
cds 

gil965068lgblU31763ISMU31763 [965068] 

(View GenBank report.FASTA report ASN.l report.Graphical view.l MEDLINE 
link, 3 protein links, or 1 nucleotide neighbor ) 

X87674 

Bacteriophage PI lydA & lydB genes 
gil974763lemblX87674IBACPlLYD [974763] 

(View GenBank report,FASTA report ASN.l report,Graphical view.l MEDLINE 
link, 2 protein links, or 2 nucleotide neighbors ) 



L48605 

Bacteriophage c2 complete genome 
gilll46276lgblL48605IC2PVCG [1146276] 

(View GenBank report,FASTA report ASN.l report,Graphical view3 MEDLINE 
links, 39 protein links, 3 nucleotide neighbors, or 1 genome link ) 



L33769 

Bacteriophage bIL67 DNA polymerase subunit (ORF3-5), essential 
recombination protein (ORF13), lysin (ORF24), minor tail protein 
(ORF31), terminase subunit (ORF32), holin (ORF37), unknown protein (ORF 
1-2,6-12,14-23,25-3033-36), complete genome 
gil522252lgblL33769IL67CG [522252] 

(View GenBank report,FASTA report ASN.l report,Graphical view.l MEDLINE 
link, 37 protein links, 2 nucleotide neighbors, or 1 genome link ) 



L31348 

Bacteriophage Tuc2009 integrase (int) gene, complete cds; lysin (lys) 
gene, 3' end 

gil508612lgblL31348ITU2INT [508612] 

(View GenBank reportJFASTA report ASN.l report,Graphical view,2 MEDLINE 
links, 3 protein links, or 3 nucleotide neighbors ) 



L31364 

Bacteriophage Tuc2009 holin (S) gene, complete cds; lysin (lys) gene, 
complete cds 

gil496281lgblL31364ITU2SLYS [496281] 
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(View GenBank reportJFASTA report,ASN.l report,Graphical view,l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 



L31366 

Bacteriophage Tuc2009 structural protein (mp2) gene, complete cds 
gil496278lgblL31366ITU2MP2A [496278] 

(View GenBank report,FASTA report,ASN.l reporf.Graphical view.l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 



L31365 

Bacteriophage Tuc2009 structural protein (mpl) gene, complete cds 
gil496276lgblL31365ITU2MPlA [496276] 

(View GenBank report,FASTA report ASN.l report.Graphical view.l MEDLINE 
link, or 1 protein link ) 



U04309 

Bacteriophage phi-LC3 putative holin (lysA) gene and putative murein 
hydrolase (lysB) gene, complete cds 
gil530796lgblU04309IBPU04309 [530796] 

(View GenBank report,FASTA report ,ASN.l report,Graphical view.l MEDLINE 
link, 2 protein links, or 1 nucleotide neighbor ) 
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Table 14 



NCBI Entrez Nucleotide QUERY 
Key word: bacteriophage and kil 
5 citations found (all selected) 



AF034975 

Bacteriophage H-19B essential recombination function protein (erf), kil 
protein (kil), regulatory protein cIII (cIII), protein gpl7 (17), N 
protein (N), cl protein (cl), cro protein (cro), ell protein (ell), 0 
protein (O), P protein (P), ren protein (ren), Roi (roi), Q protein (Q), 
Shiga-like toxin A (slt-IA) and B (slt-IB) subunits, and putative holin 
protein (S) genes, complete cds 
gil2668751lgb!AF034975l [2668751] 

(View GenBank report,FASTA report ASN.l report,Graphical view,l MEDLINE 
link, 20 protein links, or 30 nucleotide neighbors ) 

X 15637 

Bacteriophage P22 P(L) operon encompassing ral, 17, kil and arf genes 
gill5646lemblX15637iPOP22PL [15646] 

(View GenBank report,FASTA report ,ASN,1 report,Graphical view,l MEDLINE 
link, 7 protein links, or 2 nucleotide neighbors ) 



J02459 

Bacteriophage lambda, complete genome 
gi!215104!gblJ02459!LAMCG [215104] 

(View GenBank report,FASTA report^SN.l report,Graphical view,87 MEDLINE 
links, 67 protein links, 190 nucleotide neighbors, or 1 genome link ) 



M64097 

Bacteriophage Mu left end 
gi!215543igblM64097IPMULEFTEN [215543] 

(View GenBank report,FASTA report,ASN.l report,Graphical view,2 MEDLINE 
links, 39 protein links, or 15 nucleotide neighbors ) 



M18902 

Bacteriophage D108 kil gene encoding a replication protein, 3 r end; and 
containing three ORFs, complete cds 
gitl66191lgblM18902!D18KIL [166191] 

(View GenBank report,FASTA report,ASRl report,Graphical view,l MEDLINE 
link, 1 protein link, or 3 nucleotide neighbors ) 



Xhol 



BamHI 




Digestion with 
BamHI & Hindill 



HA tag 



BamHI SaK HiwMl | 
GATCCCGGTCGACCAAGCTTYACCCATA CXjACGTCCC AGACTACGCC AGC TGA 

GGCCA G CTGGTTCGAA ATGGGTATGC TGC AGGG TCT G ATGC GGTC GA CTTCGA 

HiruM 



Hindin 



Ligation 

T 




ilg. 1A 




Ligation 



Xhol 



Modified between stop 
of arsR to BamHI 



Bam HI 




PCRof pT0021 with LucFFB & LucFFH 

LucFFB - 5'- CGGGATCCATGAGGGGTTCCGAAGACG 
Start of Original BamHI 
BamW LucFf was mo dified 



Hindni 



Digestion with 
BamHl&Hindlll 



GAAAGTCCAAATrGT/14GCT7GGG- LucFFH 
$topof Hindni 
LucFF 



T 



Ligation 



Xhol 



BamHI 



Modified in the 
vicinity of BamHI 

Cloning site for ORFs: 
BamHI & HindlU 

No additional codons 
in the induced protein 



P arsR 

CTCGAG- lATG-l - jfGAb AAAAGGAGGCGGATCC fcTG-| - 

Xho\ RBS BamHI 




HindlU 



LucFF 



- tMCt CTT 
HindlU 



Fig. IB 
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Verification of pTHA/ORF clones 
by PCR and sequencing 



Fig. 2 



Fig. 3 



A) Functional assay on semi-solid support media 

Frozen stock of phage 77 pTHA/ORF 5. aureus RN4220 transformants 



O/N, 37°C 

Compare bacterial growth on plates with and without NaAsC>2 



B) Functional assay in liquid medium 



O/N culture inoculated from frozen stock of 

phage 77 pTHA/ORF X aureus RN4220 transformants 



1:100 dilution of O/N culture 

| 2 h, 37°C, 250 rpm 

Fresh culture 
| 150 |J 

2.5 ml containing 0 and 5 \iM NaAs0 2 
| 3.5h,37°C,250rpm 



1:10 and 1: 100 dilution in saline solution 




Streak onto agar plates containing 
0, 2.5, 5, and 7.5^MNaAs02 



Spot onto agar plates containing 
0, 2.5, 5, and 7.5 jiM NaAs02 





Measure OD555 



1:10 serial dilution from 10" 1 to 10"° 
I 20^1ofl0- 4 to 

Spot onto agar plate 

I OZN,37°C 

Count colonies 



A. Inhibition of bacterial growth with individual ORFs of a S. aureus bacteriophage 




Fig. 4A. 



Fig. 4B 



B. Inhibition of bacterial growth with individual ORFs of a S. aureus bacteriophage. 
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ORF number 



Fig. 4C 



C. Inhibition of bacterial growth with individual ORFs of a S. aureus bacteriophage. 
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